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PLANT DEVELOPMENTAL GENES 



RELATED APPLICATION INFORMATION 

The present invention claims the benefit from US Provisional Patent Application Serial 

Nos. 60/166,228 filed November 17, 1999 and 60/197,899 filed April 17, 2000 and "Plant Trait 
Modification III" filed August 22, 2000. 

FIELD OF THE INVENTION 

This invention relates to the field of plant biology. More particularly, the present 

invention pertains to compositions and methods for phenotypically modifying a plant. 

BACKGROUND OF THE INVENTION 

Transcription factors can modulate gene expression, either increasing or 

decreasing (inducing or repressing) the rate of transcription. This modulation results in 
differential levels of gene expression at various developmental stages, in different tissues and cell 
types, and in response to different exogenous (e.g., environmental) and endogenous stimuli 
throughout the life cycle of the organism. 

■ Because transcription factors are key controlling elements of biological 
pathways, altering the expression levels of one or more transcription factors can change entire 
biological pathways in an organism. For example, manipulation of the levels of selected 
transcription factors may result in increased expression of economically useful proteins or 
metabolic chemicals in plants or to improve other agriculturally relevant characteristics. 
Conversely, blocked or reduced expression of a transcription factor may reduce biosynthesis of 
unwanted compounds or remove an undesirable trait. Therefore, manipulating transcription 
factor levels in a plant offers tremendous potential in agricultural biotechnology for modifying a 
plant's traits. 

The present invention provides novel transcription factors useful for modifying a 
plant's phenotype in desirable ways, such as modifying a plant's structure or development. 

SUMMARY OF THE INVENTION 

In a first aspect, the invention relates to a recombinant polynucleotide comprising 

a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence encoding a 
polypeptide comprising a sequence selected from SEQ ID Nos. 2N, where N=l-23, or a 
complementary nucleotide sequence thereof; (b) a nucleotide sequence encoding a polypeptide 
comprising a conservatively substituted variant of a polypeptide of (a); (c) a nucleotide sequence 
comprising a sequence selected from those of SEQ ID Nos. 2N-1, where N=l-23, or a 
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complementary nucleotide sequence thereof; (d) a nucleotide sequence comprising silent 
substitutions in a nucleotide sequence of (c); (e) a nucleotide sequence which hybridizes under 
stringent conditions over substantially the entire length of a nucleotide sequence of one or more 
of: (a), (b), (c), or (d); (f) a nucleotide sequence comprising at least 15 consecutive nucleotides of 
5 a sequence of any of (a)-(e); (g) a nucleotide sequence comprising a subsequence or fragment of 
any of (a)-(f), which subsequence or fragment encodes a polypeptide having a biological activity 
that modifies a plant's structure and development characteristics; (h) a nucleotide sequence 
having at least 31% sequence identity to a nucleotide sequence of any of (a)-(g); (i) a nucleotide 
sequence having at least 60% identity sequence identity to a nucleotide sequence of any of (a)- 

10 (g)i 0) a nucleotide sequence which encodes a polypeptide having at least 3 1 % identity sequence 
identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; (k) a nucleotide sequence which 
encodes a polypeptide having at least 60% identity sequence identity to a polypeptide of SEQ ID 
Nos. 2N, where N=l-23; and (1) a nucleotide sequence which encodes a conserved domain of a 
polypeptide having at least 65% sequence identity to a conserved domain of a polypeptide of 

1 5 SEQ ID Nos. 2N, where N-l -23. The recombinant polynucleotide may further comprise a 

constitutive, inducible, or tissue-active promoter operably linked to the nucleotide sequence. The 
invention also relates to compositions comprising at least two of the above described 
polynucleotides. 

In a second aspect, the invention is an isolated or recombinant polypeptide 
20 comprising a subsequence of at least about 15 contiguous amino acids encoded by the 
recombinant or isolated polynucleotide described above. 

In another aspect, the invention is a transgenic plant comprising one or more of 
the above described recombinant polynucleotides. In yet another aspect, the invention is a plant 
with altered expression levels of a polynucleotide described above or a plant with altered 
25 expression or activity levels of an above described polypeptide. Further, the invention is a plant 
lacking a nucleotide sequence encoding a polypeptide described above. The plant may be a 
soybean, wheat, com, potato, cotton, rice, oilseed rape, sunflower, alfalfa, sugarcane, turf, 
banana, blackberry, blueberry, strawberry, raspberry, cantaloupe, carrot, cauliflower, coffee, 
cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, 
30 pineapple, spinach, squash, sweet corn, tobacco, tomato, watermelon, rosaceous fruits, or 
vegetable brassicas plant. 

In a further aspect, the invention relates to a cloning or expression vector 
comprising the isolated or recombinant polynucleotide described above or cells comprising the 
cloning or expression vector. 
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In yet a further aspect, the invention relates to a composition produced by 
incubating a polynucleotide of the invention with a nuclease, a restriction enzyme, a polymerase; 
a polymerase and a primer; a cloning vector, or with a cell. 

Furthermore, the invention relates to a method for producing a plant having 
5 modified structure and development traits. The method comprises altering the expression of an 
isolated or recombinant polynucleotide of the invention or altering the expression or activity of a 
polypeptide of the invention in a plant to produce a modified plant, and selecting the modified 
plant for modified structure and development traits. 

In another aspect, the invention relates to a method of identifying a factor that is 
10 modulated by or interacts with a polypeptide encoded by a polynucleotide of the invention. The 
method comprises expressing a polypeptide encoded by the polynucleotide in a plant; and 
identifying at least one factor that is modulated by or interacts with the polypeptide. In one 
embodiment the method for identifying modulating or interacting factors is by detecting binding 
by the polypeptide to a promoter sequence, or by detecting interactions between an additional 
1 5 protein and the polypeptide in a yeast two hybrid system, or by detecting expression of a factor by 
hybridization to a microarray, subtractive hybridization or differential display. 

In yet another aspect, the invention is a method of identifying a molecule that 
modulates activity or expression of a polynucleotide or polypeptide of interest. The method 
comprises placing the molecule in contact with a plant comprising the polynucleotide or 
20 polypeptide encoded by the polynucleotide of the invention and monitoring one or more of the 
expression level of the polynucleotide in the plant, the expression level of the polypeptide in the 
plant, and modulation of an activity of the polypeptide in the plant. 

In yet another aspect, the invention relates to an integrated system, computer or ' 
computer readable medium comprising one or more character strings corresponding to a 
25 polynucleotide of the invention, or to a polypeptide encoded by the polynucleotide. The 

integrated system, computer or computer readable medium may comprise a link between one or 
more sequence strings to a modified plant structure and development trait. 

In yet another aspect, the invention is a method for identifying a sequence similar 
or homologous to one or more polynucleotides of the invention, or one or more polypeptides 
30 encoded by the polynucleotides. The method comprises providing a sequence database; and, 
querying the sequence database with one or more target sequences corresponding to the one or 
more polynucleotides or to the one or more polypeptides to identify one or more sequence 
members of the database that display sequence similarity or homology to one or more of the one 
or more target sequences. 
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The method may further comprise of linking the one or more of the 
polynucleotides of the invention, or encoded polypeptides, to a modified plant structure and 
development characteristics phenotype. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 Figure 1 provides a table of exemplary polynucleotide and polypeptide sequences of the 

invention. The table includes from left to right for each sequence: the SEQ ED No., the internal 
code reference number (GID), whether the sequence is a polynucleotide or polypeptide sequence, 
and identification of any conserved domains for the polypeptide sequences. 

Figure 2 provides a table of exemplary sequences that are homologous to other sequences 
10 provided in the Sequence Listing and that are derived from Arabidopsis thaliana. The table 
includes from left to right: the SEQ ID No., the internal code reference number (GID), 
identification of the homologous sequence, whether the sequence is a polynucleotide or 
polypeptide sequence, and identification of any conserved domains for the polypeptide 
sequences. 

15 Figure 3 provides a table of exemplary sequences that are homologous to the sequences 

provided in Figures 1 and 2 and that are derived from plants other than Arabidopsis thaliana. The 
table includes from left to right: the SEQ ID No., the internal code reference number (GID), the 
unique GenBank sequence ID No. (NID), the probability that the comparison was generated by 
chance (P-value), and the species from which the homologous gene was identified. 

20 



DETAILED DESCRIPTION 

The present invention relates to polynucleotides and polypeptides, e.g. for 

modifying phenotypes of plants. 

In particular, the polynucleotides or polypeptides are useful for modifying traits 

25 associated with a plants structure or development characteristics when the expression levels of 
the polynucleotides or expression levels or activity levels of the polypeptides are altered. 
Specifically, the polynucleotides and polypeptides are useful for modifying the structure and size 
of flowers, leaves, roots, the plant as a whole, or the like, apical dominance, branching patterns, 
number of organs, organ identity, whether a plant is sterile or not, the vascularization of a plant, 

30 or the developmental staging of a plant, such as when senescence is triggered. 

The polynucleotides of the invention encode plant transcription factors. The plant 
transcription factors are derived, e.g., from Arabidopsis thaliana and can belong, e.g., to one or 
more of the following transcription factor families: the AP2 (APETALA2) domain transcription 
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factor family (Riechmann and Meyerowitz (1998) J. Biol, Chem. 379:633-646); the MYB 
transcription factor family (Martin and Paz- Ares (1997) Trends Genet. 13:67-73); the MADS 
domain transcription factor family (Riechmann and Meyerowitz (1997) J. Biol. Chem. 378:1079- 
1 101); the WRKY protein family (Ishiguro and Nakamura (1994) Mol. Gen. Genet. 244:563- 
5 571); the ankyrin-repeat protein family (Zhang et al. (1992) Plant Cell 4: 1575-1588); the 

miscellaneous protein (MISC) family (Kim et al. (1997) Plant J.' 1 1: 1237-1251); the zinc finger 
protein (Z) family (Klug and Schwabe (1995) FASEB J. 9: 597-604); the homeobox (HB) protein 
family (Duboule (1994) Guidebook to the Homeobox Genes. Oxford University Press); the 
CAAT-element binding proteins (Forsburg and Guarente (1989) Genes Dev. 3:1 166-1 178); the 

10 squamosa promoter binding proteins (SPB) (Klein et al. (1996) Mol. Gen. Genet. 1996 250:7-16); 
the NAM protein family; the IAA/AUX proteins (Rouse et al. (1998) Science 279:1371-1373); 
the HLH/MYC protein family (Littlewood et al. (1994) Prot. Profile 1:639-709); the DNA- 
binding protein (DBP) family (Tucker et al. (1994) EMBQ J. 13:2994-3002); the bZIP family of 
transcription factors (Foster et al. (1994) FASEB J. 8:192-200); the BPF-1 protein (Box P- 

15 binding factor) family (da Costa e Silva et al. (1993) Plant J. 4:125-135); and the golden protein 
(GLD) family (Hall et al. (1998) Plant Cell 10:925-936). * 

In addition to methods for modifying a plant phenotype by employing one or 
more polynucleotides and polypeptides of the invention described herein, the polynucleotides 
and polypeptides of the invention have a variety of additional uses. These uses include their use 

20 in the recombinant production (i.e, expression) of proteins; as regulators of plant gene expression, 
as diagnostic probes for the presence of complementary or partially complementary nucleic acids 
(including for detection of natural coding nucleic acids); as substrates for further reactions, e.g., 
mutation reactions, PCR reactions, or the like, of as substrates for cloning e.g., including 
digestion or ligation reactions, and for identifying exogenous or endogenous modulators of the 

25 transcription factors. 

DEFINITIONS 

A "polynucleotide" is a nucleic acid sequence comprising a plurality of 
polymerized nucleotide residues, e.g., at least about 15 consecutive polymerized nucleotide 
residues, optionally at least about 30 consecutive nucleotides, at least about 50 consecutive 
30 nucleotides. In many instances, a polynucleotide comprises a nucleotide sequence encoding a 
polypeptide (or protein) or a domain or fragment thereof. Additionally, the polynucleotide may 
comprise a promoter, an intron, an enhancer region, a polyadenylation site, a translation initiation 
site, 5' or 3* untranslated regions, a reporter gene, a selectable marker, or the like. The 
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polynucleotide can be single stranded or double stranded DNA or RNA. The polynucleotide 
optionally comprises modified bases or a modified backbone. The polynucleotide can be, e.g., 
genomic DNA or RNA, a transcript (such as an mRNA), a cDNA, a PCR product, a cloned DNA, 
a synthetic DNA or RNA, or the like. The polynucleotide can comprise a sequence in either 
5 sense or antisense orientations. 

A "recombinant polynucleotide" is a polynucleotide that is not in its native state, 
e.g., the polynucleotide comprises a nucleotide sequence not found in nature, or the 
polynucleotide is in a context other than that in which it is naturally found, e.g., separated from 
nucleotide sequences with which it typically is in proximity in nature, or adjacent (or contiguous 
10 with) nucleotide sequences with which it typically is not in proximity. For example, the sequence 
at issue can be cloned into a vector, or otherwise recombined with one or more additional nucleic 
acid. 

An "isolated polynucleotide" is a polynucleotide whether naturally occurring or 
recombinant, that is present outside the cell in which it is typically found in nature, whether 

15 purified or not. Optionally, an isolated polynucleotide is subject to one or more enrichment or 
purification procedures, e.g., cell lysis, extraction, centrifiigation, precipitation, or the like. 

A "recombinant polypeptide" is a polypeptide produced by translation of a 
recombinant polynucleotide. An "isolated polypeptide," whether a naturally occurring or a 
recombinant polypeptide, is more enriched in (or out of) a cell than the polypeptide in its natural 

20 state in a wild type cell, e.g., more than about 5% enriched, more than about 10% enriched, or 
more than about 20%, or more than about 50%, or more, enriched, i.e., alternatively denoted: 
105%, 110%, 120%, 150% or more, enriched relative to wild type standardized at 100%. Such an 
enrichment is not the result of a natural response of a wild type plant. Alternatively, or 
additionally, the isolated polypeptide is separated from other cellular components with which it is 

25 typically associated, e.g., by any of the various protein purification methods herein. 

The term "transgenic plant" refers to a plant that contains genetic material, not 
found in a wild type plant of the same species, variety or cultivar. The genetic material may 
include a transgene, an insertional mutagenesis event (such as by transposon or T-DNA 
insertional mutagenesis), an activation tagging sequence, a mutated sequence, a homologous 

30 recombination event or a sequence modified by chimeraplasty. Typically, the foreign genetic 
material has been introduced into the plant by human manipulation. 

A transgenic plant may contain an expression vector or cassette. The expression 
cassette typically comprises a polypep tide-encoding sequence operably linked (i.e., under 
regulatory control of) to appropriate inducible or constitutive regulatory sequences that allow for 
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the expression of polypeptide. The expression cassette can be introduced into a plant by 
transformation or by breeding after transformation of a parent plant. A plant refers to a whole 
plant as well as to a plant part, such as seed, fruit, leaf, or root, plant tissue, plant cells or any 
other plant material, e.g., a plant explant, as well as to progeny thereof, and. to in vitro systems 
5 that mimic biochemical or cellular components or processes in a cell. 

The phrase "ectopically expression or altered expression" in reference to a 
polynucleotide indicates that the pattern of expression in, e.g., a transgenic plant or plant tissue, is 
different from the expression pattern in a wild type plant or a reference plant of the same species. 
For example, the polynucleotide or polypeptide is expressed in a cell or tissue type other than a 

1 0 cell or tissue type in which the sequence is expressed in the wild type plant, or by expression at a 
time other than at the time the sequence is expressed in the wild type plant, or by a response to 
different inducible agents, such as hormones or environmental signals, or at different expression 
levels (either higher or lower) compared with those found in a wild type plant. The term also 
refers to altered expression patterns that are produced by lowering the levels of expression to 

15 below the detection level or completely abolishing expression. The resulting expression pattern 
can be transient or stable, constitutive or inducible. In reference to a polypeptide, the term 
"ectopic expression or altered expression" further may relate to altered activity levels resulting 
from the interactions of the polypeptides with exogenous or endogenous modulators or from 
interactions with factors or as a result of the chemical modification of the polypeptides. 

20 The term "fragment" or "domain," with respect to a polypeptide, refers to a 

subsequence of the polypeptide. In some cases, the fragment or domain, is a subsequence of the 
polypeptide which performs at least one biological function of the intact polypeptide in 
substantially the same manner, or to a similar extent, as does the intact polypeptide. For example, 
a polypeptide fragment can comprise a recognizable structural motif or functional domain such as 

25 a DNA binding domain that binds to a DNA promoter region, an activation domain or a domain 
for protein-protein interactions. Fragments can vary in size from as few as 6 amino acids to the 
full length of the intact polypeptide, but are preferably at least about 30 amino acids in length and 
more preferably at least about 60 amino acids in length. In reference to a nucleotide sequence, "a 
fragment" refers to any subsequence of a polynucleotide, typically, of at least consecutive about 

30 15 nucleotides, preferably at least about 30 nucleotides, more preferably at least about 50, of any 
of the sequences provided herein. 

The term "trait" refers to a physiological, morphological, biochemical or physical 
characteristic of a plant or particular plant material or cell. In some instances, this characteristic 
is visible to the human eye, such as seed or plant size, or can be measured by available 
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biochemical techniques, such as the protein, starch or oil content of seed or leaves or by the 
observation of the expression level of genes, e.g., by employing Northern analysis, RT-PCR, 
microarray gene expression assays or reporter gene expression systems, or by agricultural 
observations such as stress tolerance, yield or pathogen tolerance. 

"Trait modification" refers to a detectable difference in a characteristic in a plant 
ectopically expressing a polynucleotide or polypeptide of the present invention relative to a plant 
not doing so, such as a wild type plant. In some cases, the trait modification can be evaluated 
quantitatively. For example, the trait modification can entail at least about a 2% increase or 
decrease in an observed trait (difference), at least a 5% difference, at least about a 10% 
difference, at least about a 20% difference, at least about a 30%, at least about a 50%, at least 
about a 70%, or at least about a 100%, or an even greater difference. It is known that there can be 
a natural variation in the modified trait. Therefore, the trait modification observed entails a 
change of the normal distribution of the trait in the plants compared with the distribution 
observed in wild type plant. 

Trait modifications of particular interest include those to seed ( such as embryo 
or endosperm), fruit, root, flower, leaf, stem, shoot, seedling or the like, including: enhanced 
tolerance to environmental conditions including freezing, chilling, heat, drought, water saturation, 
radiation and ozone; improved tolerance to microbial, fungal or viral diseases; improved 
tolerance to pest infestations, including nematodes, mollicutes, parasitic higher plants or the like; 
decreased herbicide sensitivity; improved tolerance of heavy metals or enhanced ability to take up 
heavy metals; improved growth under poor photoconditions (e.g., low light and/or short day 
length), or changes in expression levels of genes of interest. Other phenotype that can be 
modified relate to the production of plant metabolites, such as variations in the production of 
taxol, tocopherol, tocotrienol, sterols, phytosterols, vitamins, wax monomers, anti-oxidants, 
amino acids, lignins, cellulose, tannins, prenyllipids (such as chlorophylls and carotenoids), 
glucosinolates, and terpenoids, enhanced or compositional ly altered protein or oil production 
(especially in seeds), or modified sugar (insoluble or soluble) and/or starch composition. 
Physical plant characteristics that can be modified include cell development (such as the number 
of trichomes), fruit and seed size and number, yields of plant parts such as stems, leaves and 
roots, the stability of the seeds during storage, characteristics of the seed pod (e.g., susceptibility 
to shattering), root hair length and quantity, internode distances, or the quality of seed coat. Plant 
growth characteristics that can be modified include growth rate, germination rate of seeds, vigor 
of plants and seedlings, leaf and flower senescence, male sterility, apomixis, flowering time, 
flower abscission, rate of nitrogen uptake, biomass or transpiration characteristics, as well as 
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plant architecture characteristics such as apical dominance, branching patterns, number of organs, 
organ identity, organ shape or size. 

POLYPEPTIDES AND POLYNUCLEOTIDES OF THE INVENTION 

The present invention provides, among other things, transcription factors (TFs), 
5 and transcription factor homologue polypeptides, and isolated or recombinant polynucleotides 
encoding the polypeptides. These polypeptides and polynucleotides may be employed to modify 
a plant's structure and development characteristics. 

Exemplary polynucleotides encoding the polypeptides of the invention were 
identified in the Arabidopsis thaliana GenBank database using publicly available sequence 
10 analysis programs and parameters. Sequences initially identified were then further characterized 
to identify. sequences comprising specified sequence strings corresponding to sequence motifs 
present in families of known transcription factors. Polynucleotide sequences meeting such 
criteria were confirmed as transcription factors. 

Additional polynucleotides of the invention were identified by screening 
15 Arabidopsis thaliana and/or other plant cDNA libraries with probes corresponding to known 
transcription factors under low stringency hybridization conditions. Additional sequences, 
including full length coding sequences were subsequently recovered by the rapid amplification of 
cDNA ends (RACE) procedure, using a commercially available kit according to the 
manufacturer's instructions. Where necessary, multiple rounds of RACE are performed to isolate 
20 5' and 3' ends. The full length cDNA was then recovered by a routine end-to-end polymerase 
chain reaction (PCR) using primers specific to the isolated 5' and 3' ends. Exemplary sequences 
are provided in the Sequence Listing. 

The polynucleotides of the invention were ectopically expressed in overexpressor 
or knockout plants and changes in the structure and development characteristics of the plants 
25 were observed. Therefore, the polynucleotides and polypeptides can be employed to improve the 
structure and development characteristics of plants. 

Making polynucleotides 

The polynucleotides of the invention include sequences that encode transcription 
factors and transcription factor homologue polypeptides and sequences complementary thereto, as 
30 well as unique fragments of coding sequence, or sequence complementary thereto. Such 

polynucleotides can be, e.g., DNA or RNA, e.g., mRNA, cRNA, synthetic RNA, genomic DNA, 
cDNA synthetic DNA, oligonucleotides, etc. The polynucleotides are either double-stranded or 
single-stranded, and include either, or both sense (i.e., coding) sequences and antisense (i.e., non- 
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coding, complementary) sequences. The polynucleotides include the coding sequence of a 
transcription factor, or transcription factor homologue polypeptide, in isolation, in combination 
with additional coding sequences (e.g., a purification tag, a localization signal, as a fusion- 
protein, as a pre-protein, or the like), in combination with non-coding sequences (e.g., introns or 
inteins, regulatory elements such as promoters, enhancers, terminators, and the like), and/or in a 
vector or host environment in which the polynucleotide encoding a transcription factor or 
transcription factor homologue polypeptide is an endogenous or exogenous gene. 

A variety of methods exist for producing the polynucleotides of the invention. 
Procedures for identifying and isolating DNA clones are well known to those of skill in the art, 
and are described in, e.g., Berger and Kimmel, Guide to Molecular Cloning Techniques. Methods 
in Enzymology volume 152 Academic Press, Inc., San Diego, CA ("Berger"); Sambrook et al., 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York, 1989 ("Sambrook") and Current Protocols in Molecular Biology. 
F.M. Ausubel et al., eds., Current Protocols, a joint venture between Greene Publishing 
Associates, Inc. and John Wiley & Sons, Inc., (supplemented through 2000) ("Ausubel"). 

Alternatively, polynucleotides of the invention, can be produced by a variety of 
in vitro amplification methods adapted to the present invention by appropriate selection of 
specific or degenerate primers. Examples of protocols sufficient to direct persons of skill through 
in vitro amplification methods, including the polymerase chain reaction (PCR) the ligase chain 
reaction (LCR), Qbeta-replicase amplification and other RNA polymerase mediated techniques 
(e.g., NASBA), e.g., for the production of the homologous nucleic acids of the invention are 
found in Berger, Sambrook, and Ausubel, as well as Mullis et al., (1987) PCR Protocols A Guide 
to Methods and Applications (Innis et al. eds) Academic Press Inc. San Diego, CA (1990) (Innis). 
Improved methods for cloning in vitro amplified nucleic acids are described in Wallace et al., 
U.S. Pat. No. 5,426,039. Improved methods for amplifying large nucleic acids by PCR are 
summarized in Cheng et al. (1994) Nature 369: 684-685 and the references cited therein, in which 
PCR amplicons of up to 40kb are generated. One of skill will appreciate that essentially any 
RNA can be converted into a double stranded DNA suitable for restriction digestion, PCR 
expansion and sequencing using reverse transcriptase and a polymerase. See, e.g., Ausubel, 
Sambrook and Berger, all supra. 

Alternatively, polynucleotides and oligonucleotides of the invention can be 
assembled from fragments produced by solid-phase synthesis methods. Typically, fragments of 
up to approximately 100 bases are individually synthesized and then enzyniatically or chemically 
ligated to produce a desired sequence, e.g., a polynucleotide encoding ail or part of a 
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transcription factor. For example, chemical synthesis using the phosphoramidite method is 
described, e.g., by Beaucage et al. (1981) Tetrahedron Letters 22:1859-69; and Matthes et al. 
(1984) EMBO J. 3:801-5. According to such methods, oligonucleotides are synthesized, purified, 
annealed to their complementary strand, ligated and then optionally cloned into suitable vectors. 
5 And if so desired, the polynucleotides and polypeptides of the invention can be custom ordered 
from any of a number of commercial suppliers. 

HOMOLOGOUS SEQUENCES 

Sequences homologous, i.e., that share significant sequence identity or similarity, 
to those provided in the Sequence Listing, derived from Arabidopsis thaliana or from other plants 

10 of choice are also an aspect of the invention. Homologous sequences can be derived from any 
plant including monocots and dicots and in particular agriculturally important plant species, 
including but not limited to, crops such as soybean, wheat, corn, potato, cotton, rice, oilseed rape 
(including canola), sunflower, alfalfa, sugarcane and turf; or fruits and vegetables, such as 
banana, blackberry, blueberry, strawberry, and raspberry, cantaloupe, carrot, cauliflower, coffee, 

15 cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, 
pineapple, spinach, squash, sweet com, tobacco, tomato, watermelon, rosaceous fruits (such as 
apple, peach, pear, cherry and plum) and vegetable brassicas (such as broccoli, cabbage, 
cauliflower, brussel sprouts and kohlrabi). Other crops, fruits and vegetables whose phenotype 
can be changed include barley, rye, millet, sorghum, currant, avocado, citrus fruits such as 

20 oranges, lemons, grapefruit and tangerines, artichoke, cherries, nuts such as the walnut and 
peanut, endive, leek, roots, such as arrowroot, beet, cassava, turnip, radish, yam, and sweet 
potato, and beans. The homologous sequences may also be derived from woody species, such 
pine, poplar and eucalyptus. 

Transcription factors that are homologous to the listed sequences will typically 

25 share at least about 30% amino acid sequence identity. More closely related transcription factors 
can share at least about 50%, about 60%, about 65%, about 70%, about 75% or about 80% or 
about 90% or about 95% or about 98% or more sequence identity with the listed sequences. 
Factors that are most closely related to the listed sequences share, e.g., at least about 85%, about 
90% or about 95% or more % sequence identity to the listed sequences. At the nucleotide level, 

30 the sequences will typically share at least about 40% nucleotide sequence identity, preferably at 
least about 50%, about 60%, about 70% or about 80% sequence identity, and more preferably 
about 85%, about 90%, about 95% or about 97% or more sequence identity to one or more of the 
listed sequences. The degeneracy of the genetic code enables major variations in the nucleotide 

11 



WO 01/36444 



PCT/US00/31325 



sequence of a polynucleotide while maintaining the amino acid sequence of the encoded protein. 
Conserved domains within a transcription factor family may exhibit a higher degree of sequence 
homology, such as at least 65% sequence identity including conservative substitutions, and 
preferably at least 80% sequence identity. 

Identifying Nucleic Acids bv Hybridization 

Polynucleotides homologous to the sequences illustrated in the Sequence Listing 
can be identified, e.g., by hybridization to each other under stringent or under highly stringent 
conditions. Single stranded polynucleotides hybridize when they associate based on a variety of 
well characterized physico-chemical forces, such as hydrogen bonding, solvent exclusion, base 
stacking and the like. The stringency of a hybridization reflects the degree of sequence identity 
of the nucleic acids involved, such that the higher the stringency, the more similar are the two 
polynucleotide strands. Stringency is influenced by a variety of factors, including temperature, 
salt concentration and composition, organic and non-organic additives, solvents, etc. present in 
both the hybridization and wash solutions and incubations (and number), as described in more 
detail in the references cited above. 

An example of stringent hybridization conditions for hybridization of 
complementary nucleic acids which have more than 100 complementary residues on a filter in a 
Southern or northern blot is about 5°C to 20°C lower than the thermal melting point (Tm) for the 
specific sequence at a defined ionic strength and pH. The T m is the temperature (under defined 
ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched 
probe. Nucleic acid molecules that hybridize under stringent conditions will typically hybridize 
to a probe based on either the entire cDNA or selected portions, e.g., to a unique subsequence, of 

the cDNA under wash conditions of 0.2x SSC to 2.0 x SSC, 0.1% SDS at 50-65° C, for example 

0.2 x SSC, 0.1% SDS at 65° C. For identification of less closely related homologues washes can 

be performed at a lower temperature, e.g., 50° C. In general, stringency is increased by raising 
the wash temperature and/or decreasing the concentration of SSC. 

As another example, stringent conditions can be selected such that an 
oligonucleotide that is perfectly complementary to the coding oligonucleotide hybridizes to the 
coding oligonucleotide with at least about a 5-1 Ox higher signal to noise ratio than the ratio for 
hybridization of the perfectly complementary oligonucleotide to a nucleic acid encoding a 
transcription factor known as of the filing date of the application. Conditions can be selected 
such that a higher signal to noise ratio is observed in the particular assay which is used, e.g., 
about 15x, 25x, 35x, 50x or more. Accordingly, the subject nucleic acid hybridizes to the unique 
coding oligonucleotide with at least a 2x higher signal to noise ratio as compared to hybridization 
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of the coding oligonucleotide to a nucleic acid encoding known polypeptide. Again, higher 
signal to noise ratios can be selected, e.g., about 5x, lOx, 25x, 35x, 50x or more. The particular 
signal will depend on the label used in the relevant assay, e.g., a fluorescent label, a colorimetric 
label, a radio active label, or the like. 

Alternatively, transcription factor homologue polypeptides can be obtained by 
screening an expression library using antibodies specific for one or more transcription factors. 
With the provision herein of the disclosed transcription factor, and transcription factor homologue 
nucleic acid sequences, the encoded polypeptide(s) can be expressed and purified in a 
heterologous expression system (e.g., E. coli) and used to raise antibodies (monoclonal or 
polyclonal) specific for the polypeptide(s) in question. Antibodies can also be raised against 
synthetic peptides derived from transcription factor, or transcription factor homologue, amino 
acid sequences. Methods of raising antibodies are well known in the art and are described in 
Harlow and Lane (1988) Antibodies: A Laboratory Manual . Cold Spring Harbor Laboratory, New 
York. Such antibodies can then be used to screen an expression library produced from the plant 
from which it is desired to clone additional transcription factor homologues, using the methods 
described above. The selected cDNAs can be confirmed by sequencing and enzymatic activity. 

SEQUENCE VARIATIONS 

It will readily be appreciated by those of skill in the art, that any of a variety of 
polynucleotide sequences are capable of encoding the transcription factors and transcription 
factor homologue polypeptides of the invention. Due to the degeneracy of the genetic code, 
many different polynucleotides can encode identical and/or substantially similar polypeptides in 
addition to those sequences illustrated in the Sequence Listing. 

For example, Table 1 illustrates, e.g., that the codons AGC, AGT, TCA, TCC, 
TCG, and TCT all encode the same amino acid: serine. Accordingly, at each position in the 
sequence where there is a codon encoding serine, any of the above trinucleotide sequences can be 
used without altering the encoded polypeptide. 
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Table 1 



Amino acids 


Codon 


Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


TGC 


TGT 


• 








Aspartic acid 


Asp 


D 


GAC 


GAT 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


TTC 


TTT 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGT 






Histidine 


His 


H 


CAC 


CAT 










Isoleucine 


He 


I 


ATA 


ATC 


ATT 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine . 


Leu. 


L 


TTA 


TTG 


CTA 


CTC 


CTG 


CTT 


Methionine 


Met 


M 


ATG 












Asparagine 


Asn 


N 


AAC 


AAT 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


CCT 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGT 


Serine 


Ser 


S 


AGC 


AGT 


TCA 


TCC 


TCG 


TCT 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACT 






Valine 


Val 


V 


GTA 


GTC 


GTG 


GTT 






Tryptophan 


Trp 


W 


TGG . 












Tyrosine 


Tyr 


Y 


TAC 


TAT 











Sequence alterations that do not change the amino acid sequence encoded by the 
polynucleotide are termed "silent" variations. With the exception of the codons ATG and TGG, 
encoding methionine and tryptophan, respectively, any of the possible codons for the same amino 
acid can be substituted by a variety of techniques, e.g., site-directed mutagenesis, available in the 
art. Accordingly, any and all such variations of a sequence selected from the above table are a 
feature of the invention. 

In addition to silent variations, other conservative variations that alter one, or a 
few amino acids in the encoded polypeptide, can be made without altering the function of the 
polypeptide, these conservative variants are, likewise, a feature of the invention. 

For example, substitutions, deletions and insertions introduced into the sequences 
provided in the Sequence Listing are also envisioned by the invention. Such sequence 
modifications can be engineered into a sequence by site-directed mutagenesis (Wu (ed.) Meth. 
Enzvmol . (1993) vol. 217, Academic Press) or the other methods noted below. Amino acid 
substitutions are typically of single residues; insertions usually will be on the order of about from 
1 to 10 amino acid residues; and deletions will range about from 1 to 30 residues. In preferred 
embodiments, deletions or insertions are made in adjacent pairs, e.g., a deletion of two residues or 
insertion of two residues. Substitutions, deletions, insertions or any combination thereof can be 
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combined to arrive at a sequence. The mutations that are made in the polynucleotide encoding the 
transcription factor should not place the sequence out of reading frame and should not create 
complementary regions that could produce secondary mRNA structure. Preferably, the 
polypeptide encoded by the DNA performs the desired function. 
5 Conservative substitutions are those in which at least one residue in the amino 

acid sequence has been removed and a different residue inserted in its place. Such substitutions 
generally are made in accordance with the Table 2 when it is desired to maintain the activity of 
the protein. Table 2 shows amino acids which can be substituted for an amino acid in a protein 
and which are typically regarded as conservative substitutions. 
10 Table 2 



U AC 1 /ill A 


pAncoi'v'itSvo CiihcHfufiAtic 

^unaervdiivc ouuaiiiuiiuiiN 


Aia 


oer 


A TO 




A en 
noil 


niiv Hie 


Asp 


Glu 


Gin 


Asn 


Cys 


Ser 


Glu 


Asp 


Gly 


Pro 


His 


Asn; Gin 


He 


Leu, Val 


Leu 


Be; Val 


Lys 


Arg; Gin 


Met 


Leu; He 


Phe 


Met; Leu; Tyr 


Ser 


Thr; Gly 


Thr 


SerjVal 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



Substitutions that are less conservative than those in Table 2 can be selected by 
picking residues that differ more significantly in their effect on maintaining (a) the structure of 
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the polypeptide backbone in the area of the substitution, for example, as a sheet or helical 
conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of 
the side chain. The substitutions which in general are expected to produce the greatest changes in 
protein properties will be those in which (a) a hydrophilic residue, e.g., seryl or threonyl, is 
5 substituted for (or by) a hydrophobic residue, e.g., leucyl, isoleucyl, phenylalanyl, valyl or alanyl; 
(b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 
electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., 
phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine. 

10 FURTHER MODIFYING SEQUENCES OF THE INVENTION— MUTATION/ FORCED 
EVOLUTION 

In addition to generating silent or conservative substitutions as noted, above, the 
present invention optionally includes methods of modifying the sequences of the Sequence 
Listing. In the methods, nucleic acid or protein modification methods are used to alter the given 

1 5 sequences to produce new sequences and/or to chemically or enzymatically modify given 
sequences to change the properties of the nucleic acids or proteins. 

Thus, in one embodiment, given nucleic acid sequences are modified, e.g., 
according to standard mutagenesis or artificial evolution methods to produce modified sequences. 
For example, Ausubel, supra, provides additional details on mutagenesis methods. Artificial 

20 forced evolution methods are described, e.g., by Stemmer (1994) Nature 370:389-391, and 
Stemmer (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751. Many other mutation and 
evolution methods are also available and expected to be within the skill of the practitioner. 

Similarly, chemical or enzymatic alteration of expressed nucleic acids and 
polypeptides can be performed by standard methods. For example, sequence can be modified by 

25 addition of lipids, sugars, peptides, organic or inorganic compounds, by the inclusion of modified 
nucleotides or amino acids, or the like. For example, protein modification techniques are 
illustrated in Ausubel, supra. Further details on chemical and enzymatic modifications can be 
found herein. These modification methods can be used to modify any given sequence, or to 
modify any sequence produced by the various mutation and artificial evolution modification 

30 methods noted herein. 

Accordingly, the invention provides for modification of any given nucleic acid 
by mutation, evolution, chemical or enzymatic modification, or other available methods, as well 
as for the products produced by practicing such methods, e.g., using the sequences herein as a 
starting substrate for the various modification approaches. 
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For example, optimized coding sequence containing codons preferred by a 
particular prokaryotic or eukaryotic host can be used e.g., to increase the rate of translation or to 
produce recombinant RNA transcripts having desirable properties, such as a longer half-life, as 
compared with transcripts produced using a non-optimized sequence. Translation stop codons 
can also be modified to reflect host preference. For example, preferred stop codons for S. 
cerevisiae and mammals are TAA and TGA, respectively. The preferred stop codon for 
monocotyledonous plants is TGA, whereas insects and E. coli prefer to use TAA as the stop 
codon. 

The polynucleotide sequences of the present invention can also be engineered in 
order to alter a coding sequence for a variety of reasons, including but not limited to, alterations 
which modify the sequence to facilitate cloning, processing and/or expression of the gene 
product. For example, alterations are optionally introduced using techniques which are well 
known in the art, e.g., site-directed mutagenesis, to insert new restriction sites, to alter 
glycosylation patterns, to change codon preference, to introduce splice sites, etc. 

Furthermore, a fragment or domain derived from any of the polypeptides of the 
invention can be combined with domains derived from other transcription factors or synthetic 
domains to modify the biological activity of a transcription factor. For instance, a DNA binding 
domain derived from a transcription factor of the invention can be combined with the activation 
domain of another transcription factor or with a synthetic activation domain. A transcription 
activation domain assists in initiating transcription from a DNA binding site. Examples include 
the transcription activation region of VP16 or GAL4 (Moore et al. (1998) Proc. Natl. Acad. Sci. 
USA 95: 376-381; and Aoyarnaetal. (1995) PlantCeU 7: 1773-1785), peptides derived from 
bacterial sequences (Ma and Ptashne (1987) Cell 51; 1 13-119) and synthetic peptides (Giniger 
and Ptashne, (1987) Nature 330:670-672). 

EXPRESSION AND MODIFICATION OF POLYPEPTIDES 

Typically, polynucleotide sequences of the invention are incorporated into 
recombinant DNA (or RNA) molecules that direct expression of polypeptides of the invention in 
appropriate host cells, transgenic plants, in vitro translation systems, or the like. Due to the 
inherent degeneracy of the genetic code, nucleic acid sequences which encode substantially the 
same or a functionally equivalent amino acid sequence can be substituted for any listed sequence 
to provide for cloning and expressing the relevant homologue. 
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Vectors. Promoters and Expression Systems 

The present invention includes recombinant constructs comprising one or more 
of the nucleic acid sequences herein. The constructs typically comprise a vector, such as a 
plasmid, a cosmid, a phage, a virus (e.g., a plant virus), a bacteria! artificial chromosome (BAC), 
5 a yeast artificial chromosome (Y AC), or the like, into which a nucleic acid sequence of the 
invention has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
embodiment, the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters are 
known to those of skill in the art, and are commercially available. 

10 General texts which describe molecular biological techniques useful herein, 

including the use and production of vectors, promoters and many other relevant topics, include 
Berger, Sambrook and Ausubel, supra. Any of the identified sequences can be incorporated into a 
cassette or vector, e.g., for expression in plants. A number of expression vectors suitable for stable 
transformation of plant ceils or for the establishment of transgenic plants have been described 

1 5 including those described in Weissbach and Weissbach, (1989) Methods for Plant Molecular 
Biology . Academic Press, and Gelvin et al., (1990) Plant Molecular Biology Manual . Kluwer 
Academic Publishers. Specific examples include those derived from a Ti plasmid of 
Agrobacterium tumefaciens, as well as those disclosed by Herrera-Estrella et al. (1983) Nature 
303: 209, Bevan (1984) Nucl Acid Res. 12: 8711-8721, Klee (1985) Bio/Technology 3: 637-642, 

20 for dicotyledonous plants. 

Alternatively, non-Ti vectors can be used to transfer the DNA into 
monocotyledonous plants and cells by using free DNA delivery techniques. Such methods can 
involve, for example, the use of liposomes, electroporation, microprojectile bombardment, silicon 
carbide whiskers, and viruses. By using these methods transgenic plants such as wheat, rice 

25 (Christou (1991) Bio/Technology 9: 957-962) and corn (Gordon-Kamm (1990) Plant Cell 2: 603- 
618) can be produced. An immature embryo can also be a good target tissue for monocots for 
direct DNA delivery techniques by using the particle gun (Weeks et al. (1993) Plant Physiol 102: 
1077-1084; Vasil (1993) Bio/Technology 10: 667-674; Wan and Lemeaux (1994) Plant Phvsiol 
104: 37-48, and for Agrobacterium-mediated DNA transfer (Ishida et al. (1996) Nature Biotech 

30 14: 745-750). 

Typically, plant transformation vectors include one or more cloned plant coding 
sequence (genomic or cDNA) under the transcriptional control of 5' and 3' regulatory sequences 
and a dominant selectable marker. Such plant transformation vectors typically also contain a 
promoter (e.g., a regulatory region controlling inducible or constitutive, environmentally-or 

18 



WO 01/36444 PCT/US00/31325 

developmentally-regulated, or cell- or tissue-specific expression), a transcription initiation start 
site, an RNA processing signal (such as intron splice sites), a transcription termination site, and/or 
a polyadenylation signal. 

Examples of constitutive plant promoters which can be useful for expressing the 
5 TF sequence include: the cauliflower mosaic virus (CaMV) 35S promoter, which confers 
constitutive, high-level expression in most plant tissues (see, e.g., Odel et al. (1985) Nature 
313:810); the nopaline synthase promoter (An et al. (1988) Plant Phvsiol 88:547); and the 
octopine synthase promoter (Fromm et al. (1989) Plant Cell 1: 977). 

A variety of plant gene promoters that regulate gene expression in response to 

10 environmental, hormonal, chemical, developmental signals, and in a tissue-active manner can be 
used for expression of a TF sequence in plants. Choice of a promoter is based largely on the 
phenotype of interest and is determined by such factors as tissue (e.g., seed, fruit, root, pollen, 
vascular tissue, flower, carpel, etc.), inducibility (e.g., in response to wounding, heat, cold, 
drought, light, pathogens, etc.), timing, developmental stage, and the like. Numerous known 

1 5 promoters have been characterized and can favorable be employed to promote expression of a 
polynucleotide of the invention in a transgenic plant or cell of interest. For example, tissue 
specific promoters include: seed-specific promoters (such as the napin, phaseolin or DC3 
promoter described in US Pat. No. 5,773,697), fruit-specific promoters that are active during fruit 
ripening (such as the dru 1 promoter (US Pat. No. 5,783,393), or the 2A1 1 promoter (US Pat. No. 

20 4,943,674) and the tomato polygalacturonase promoter (Bird et al. (1988) Plant Mol Biol 1 1:651), 
root-specific promoters, such as those disclosed in US Patent Nos. 5,618,988, 5,837,848 and 
5,905,186, pollen-active promoters such as PTA29, PTA26 and PTA13 (US Pat. No. 5,792,929), 
promoters active in vascular tissue (Ringli and Keller (1998) Plant Mol Biol 37:977-988), flower- 
specific (Kaiser et al, (1995) Plant Mol Biol 28:231-243), pollen (Baerson et al. (1994) Plant Mol 

25 Biol 26: 1947-1959), carpels (Ohl et al. (1990) Plant Cell 2:837-848), pollen and ovules (Baerson 
et al. (1993) Plant Mol Biol 22:255-267), auxin-inducible promoters (such as that described in 
van der Kop et al. (1999) Plant Mol Biol 39:979-990 or Baumann et al. (1999) Plant Cell 1 1:323- 
334), cytokinin-inducible promoter (Guevara-Garcia (1998) Plant Mol Biol 38:743-753), 
promoters responsive to gibberellin (Shi et al. (1 998) Plant Mol Biol 38: 1053-1060, Willmott et 

30 al. (1998) 38:817-825) and the like. Additional promoters are those that elicit expression in 
response to heat (Ainley et al. (1993) Plant Mol Biol 22: 13-23), light (e.g., the pea rbcS-3A 
promoter, Kuhlemeier et al. (1989) Plant Cell 1:471, and the maize rbcS promoter, Schaffher and 
Sheen (1991) Plant Cell 3: 997); wounding (e.g., wunl, Siebertz et al. (1989) Plant Cell 1: 961); 
pathogens (such as the PR-1 promoter described in Buchel et al. (1999) Plant Mol. Biol. 40:387- 
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396, and the PDF1.2 promoter described in Manners et al. (1998) Plant Mol. Biol. 38:1071-80), 
and chemicals such as methyl jasmonate or salicylic acid (Gatz et al. (1997) Plant Mol Biol 48: 89- 
108). In addition, the timing of the expression can be controlled by using promoters such as those 
acting at senescence (An and Amazon (1995) Science 270: 1986-1988); or late seed development 
5 (Odell et al. (1994) Plant Physiol 106:447-458). 

Plant expression vectors can also include RNA processing signals that can be 
positioned within, upstream or downstream of the coding sequence. In addition, the expression 
vectors can include additional regulatory sequences from the 3'-untranslated region of plant 
genes, e.g., a 3* terminator region to increase mRNA stability of the mRNA, such as the PI-II 
10 terminator region of potato or the octopine or nopaline synthase 3' terminator regions. 

Additional Expression Elements 

Specific initiation signals can aid in efficient translation of coding sequences. 
These signals can include, e.g., the ATG initiation codon and adjacent sequences. In cases where 
a coding sequence, its initiation codon and upstream sequences are inserted into the appropriate 

15 expression vector, no additional translational control signals may be needed. However, in cases 
where only coding sequence (e.g., a mature protein coding sequence), or a portion thereof, is 
inserted, exogenous transcriptional control signals including the ATG initiation codon can be 
separately provided. The initiation codon is provided in the correct reading frame to facilitate 
transcription. Exogenous transcriptional elements and initiation codons can be of various origins, 

20 both natural and synthetic. The efficiency of expression can be enhanced by the inclusion of 
enhancers appropriate to the cell system in use. 

Expression Hosts 

The present invention also relates to host cells which are transduced with vectors 
of the invention, and the production of polypeptides of the invention (including fragments 

25 thereof) by recombinant techniques. Host cells are genetically engineered (i.e, nucleic acids are 
introduced, e.g., transduced, transformed or transfected) with the vectors of this invention, which 
may be, for example, a cloning vector or an expression vector comprising the relevant nucleic 
acids herein. The vector is optionally a plasmid, a viral particle, a phage, a naked nucleic acids, 
etc. The engineered host cells can be cultured in conventional nutrient media modified as 

30 appropriate for activating promoters, selecting transformants, or amplifying the relevant gene. 
The culture conditions, such as temperature, pH and the like, are those previously used with the 
host cell selected for expression, and will be apparent to those skilled in the art and in the 
references cited herein, including, Sambrook and Ausubel. 
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The host cell can be a eukaryotic cell, such as a yeast cell, or a plant cell, or the 
host cell can be a prokaryotic cell, such as a bacterial cell. Plant protoplasts are also suitable for 
some applications. For example, the DNA fragments are introduced into plant tissues, cultured 
plant cells or plant protoplasts by standard methods including electroporation (Fromm et al. t 
(1985) Proc. Natl. Acad. Sci. USA 82, 5824, infection by viral vectors such as cauliflower mosaic 
virus (CaMV) (Hohn et al., (1982) Molecular Biology of Plant Tumors . (Academic Press, New 
York) pp. 549-560; US 4,407,956), high velocity ballistic penetration by small particles with the 
nucleic acid either within the matrix of small beads or particles, or on the surface (Klein et al., 
(1987) Nature 327. 70-73), use of pollen as vector (WO 85/01856), or use of Agrobacterium 
tumefaciens or A. rhizogenes carrying a T-DNA plasmid in which DNA fragments are cloned. 
The T-DNA plasmid is transmitted to plant cells upon infection by Agrobacterium tumefaciens, 
and a portion is stably integrated into the plant genome (Horsch et al. (1984) Science 233:496- 
498; Fraley et al. (1983) Proc. Natl. Acad. Sci. USA 80, 4803). 

The cell can include a nucleic acid of the invention which encodes a polypeptide, 
wherein the cells expresses a polypeptide of the invention. The cell can also include vector 
sequences, or the like. Furthermore, cells and transgenic plants which include any polypeptide or 
nucleic acid above or throughout this specification, e.g., produced by transduction of a vector of 
the invention, are an additional feature of the invention. 

For long-term, high-yield production of recombinant proteins, stable expression 
can be used. Host cells transformed with a nucleotide sequence encoding a polypeptide of the 
invention are optionally cultured under conditions suitable for the expression and recovery of the 
encoded protein from cell culture. The protein or fragment thereof produced by a recombinant 
cell may be secreted, membrane-bound, or contained intracellularly, depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides encoding mature proteins of the invention can be designed with signal 
sequences which direct secretion of the mature polypeptides through a prokaryotic or eukaryotic 
cell membrane. 

Modified Amino Acids 

Polypeptides of the invention may contain one or more modified amino acids. 
The presence of modified amino acids may be advantageous in, for example, increasing 
polypeptide half-life, reducing polypeptide antigenicity or toxicity, increasing polypeptide storage 
stability, or the like. Amino acid(s) are modified, for example, co-translationally or post- 
translationally during recombinant production or modified by synthetic or chemical means. 
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Non-limiting examples of a modified amino acid include incorporation or other 
use of acetylated amino acids, glycosylated amino acids, sulfated amino acids, prenylated (e.g., 
farnesylated, geranylgeranylated) amino acids, PEG modified (e.g., "PEGylated") amino acids, 
biotinylated amino acids, carboxylated amino acids, phosphorylated amino. acids, etc. References 
5 adequate to guide one of skill in the modification of amino acids are replete throughout the 
literature. 

IDENTIFICATIQN OF ADDITIONAL FACTORS 

A transcription factor provided by the present invention can also be used to 
identify additional endogenous or exogenous molecules that can affect a phentoype or trait of 
10 interest. On the one hand, such molecules include organic (small or large molecules) and/or 
inorganic compounds that affect expression of (i.e., regulate) a particular transcription factor. 
Alternatively, such molecules include endogenous molecules that are acted upon either at a 
transcriptional level by a transcription factor of the invention to modify a phenotype as desired. 
For example, the transcription factors can be employed to identify one or more downstream gene 

1 5 with which is subject to a regulatory effect of the transcription factor; In one approach, a 

transcription factor or transcription factor homologue of the invention is expressed in a host cell, 
e.g, a transgenic plant cell, tissue or explant, and expression products, either RNA or protein, of 
likely or random targets are monitored, e.g., by hybridization to a microarTay of nucleic acid 
probes corresponding to genes expressed in a tissue or cell type of interest, by two-dimensional 

20 gel electrophoresis of protein products, or by any other method known in the art for assessing 
expression of gene products at the level of RNA or protein. Alternatively, a transcription factor 
of the invention can be used to identify promoter sequences (i.e., binding sites) involved in the 
regulation of a downstream target. After identifying a promoter sequence, interactions between 
the transcription factor and the promoter sequence can be modified by changing specific 

25 nucleotides in the promoter sequence or specific amino acids in the transcription factor that 
interact with the promoter sequence to alter a plant trait. Typically, transcription factor DNA 
binding sites are identified by gel shift assays. After identifying the promoter regions, the 
promoter region sequences can be employed in double-stranded DNA arrays to identify 
molecules that affect the interactions of the transcription factors with their promoters (Bulyk et al. 

30 (1999) Nature Biotechnology 17:573-577). 

The identified transcription factors are also useful to identify proteins that modify 
the activity of the transcription factor. Such modification can occur by covalent modification, 
such as by phosphorylation, or by protein-protein (homo or-heteropolymer) interactions. Any 
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method suitable for detecting protein-protein interactions can be employed. Among the methods 
that can be employed are co-immunoprecipitation, cross-linking and co-purification through 
gradients or chromatographic columns, and the two-hybrid yeast system. 

The two-hybrid system detects protein interactions in vivo and is described in 
Chien, et al., (1991), Proc. Natl. Acad. Sci. USA 88, 9578-9582 and is commercially available 
from Clontech (Palo Alto, Calif.). In such a system, plasmids are constructed that encode two 
hybrid proteins: one consists of the DNA-binding domain of a transcription activator protein 
fused to the TF polypeptide and the other consists of the transcription activator protein's 
activation domain fused to an unknown protein that is encoded by a cDNA that has been 
recombined into the plasmid as part of a cDNA library. The DNA-binding domain fusion plasmid 
and the cDNA library are transformed into a strain of the yeast Saccharomyces cerevisiae that 
contains a reporter gene (e.g., lacZ) whose regulatory region contains the transcription activator's 
binding site. Either hybrid protein alone cannot activate transcription of the reporter gene. 
Interaction of the two hybrid proteins reconstitutes the functional activator protein and results in 
expression of the reporter gene, which is detected by an assay for the reporter gene product. Then, 
the library plasmids responsible for reporter gene expression are isolated and sequenced to 
identify the proteins encoded by the library plasmids. After identifying proteins that interact with 
the transcription factors, assays for compounds that interfere with the TF protein-protein 
interactions can be preformed. 

IDENTIFICATION OF MODULATORS 

In addition to the intracellular molecules described above, extracellular 
molecules that alter activity or expression of a transcription factor, either directly or indirectly, 
can be identified. For example, the methods can entail first placing a candidate molecule in 
contact with a plant or plant cell. The molecule can be introduced by topical administration, such 
as spraying or soaking of a plant, and then the molecule's effect on the expression or activity of 
the TF polypeptide or the expression of the polynucleotide monitored. Changes in the expression 
of the TF polypeptide can be monitored by use of polyclonal or monoclonal antibodies, gel 
electrophoresis or the like. Changes in the expression of the corresponding polynucleotide 
sequence can be detected by use of microarrays, Northerns, quantitative PCR, or any other 
technique for monitoring changes in mRNA expression. These techniques are exemplified in 
Ausubel et al. (eds) Current Protocols in Molecular Biology. John Wiley & Sons (1998). Such 
changes in the expression levels can be correlated with modified plant traits and thus identified 
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molecules can be useful for soaking or spraying on fruit, vegetable and grain crops to modify 
traits in plants. 

Essentially any available composition can be tested for modulatory activity of 
expression or activity of any nucleic acid or polypeptide herein. Thus, available libraries of 
compounds such as chemicals, polypeptides, nucleic acids and the like can be tested for 
modulatory activity. Often, potential modulator compounds can be dissolved in aqueous or 
organic (e.g., DMSO-based) solutions for easy delivery to the cell or plant of interest in which the 
activity of the modulator is to be tested. Optionally, the assays are designed to screen large 
modulator composition libraries by automating the assay steps and providing compounds from 
any convenient source to assays, which are typically run in parallel (e.g., in microtiter formats on 
microtiter plates in robotic assays). 

In one embodiment, high throughput screening methods involve providing a 
combinatorial library containing a large number of potential compounds (potential modulator 
compounds). Such "combinatorial chemical libraries" are then screened in one or more assays, as 
described herein, to identify those library members (particular chemical species or subclasses) 
that display a desired characteristic activity. The compounds thus identified can serve as target 
compounds. 

A combinatorial chemical library can be, e.g., a collection of diverse chemical 
compounds generated by chemical synthesis or biological synthesis. For example, a 
combinatorial chemical library such as a polypeptide library is formed by combining a set of 
chemical building blocks (e.g., in one example/amino acids) in every possible way for a given 
compound length (i.e., the number of amino acids in a polypeptide compound of a set length). 
Exemplary libraries include peptide libraries, nucleic acid libraries, antibody libraries (see, e.g., 
Vaughn et al. (1996) Nature Biotechnology. 14(3):309-314 and PCT/US96/10287), carbohydrate 
libraries (see, e.g., Liang et al. Science (1996) 274:1520-1522 and U.S. Patent 5,593,853), 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083), and small organic molecule 
libraries (see, e.g., benzodiazepines, Baum C&EN Jan 18, page 33 (1993); isoprenoids, U.S. 
Patent 5,569,588; thiazolidinones and metathiazanones, U.S. Patent 5,549,974; pyrrolidines, U.S. 
Patents 5,525,735 and 5,519,134; morpholino compounds, U.S. Patent 5,506,337) and the like. 

Preparation and screening of combinatorial or other libraries is well known to 
those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka, Int. J. Pept. Prot. Res. 37:487-493 
(1991) and Houghton et al. Nature 354:84-88 (1991)). Other chemistries for generating chemical 
diversity libraries can also be used. 
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In addition, as noted, compound screening equipment for high-throughput 
screening is generally available, e.g., using any of a number of well known robotic systems that 
have also been developed for solution phase chemistries useful in assay systems. These systems 
include automated workstations including an automated synthesis apparatus and robotic systems 
utilizing robotic arms. Any of theabove devices are suitable for use with the present invention, 
e.g., for high-throughput screening of potential modulators. The nature and implementation of 
modifications to these devices (if any) so that they can operate as discussed herein will be 
apparent to persons skilled in the relevant art. 

Indeed, entire high throughput screening systems are commercially available. 
These systems typically automate entire procedures including all sample and reagent pipetting, 
liquid dispensing, timed incubations, and final readings of the microplate in detector(s) 
appropriate for the assay. These configurable systems provide high throughput and rapid start up 
as well as a high degree of flexibility and customization. Similarly, microfluidic implementations 
of screening are also commercially available. 

The manufacturers of such systems provide detailed protocols the various high 
throughput. Thus, for example, Zymark Corp. provides technical bulletins describing screening 
systems for detecting the modulation of gene transcription, ligand binding, and the like. The 
integrated systems herein, in addition to providing for sequence alignment and, optionally, 
synthesis of relevant nucleic acids, can include such screening apparatus to identify modulators 
that have an effect on one or more polynucleotides or polypeptides according to the present 
invention. 

In some assays it is desirable to have positive controls to ensure that the 
components of the assays are working properly. At least two types of positive controls are 
appropriate. That is, known transcriptional activators or inhibitors can be incubated with 
cells/plants/ etc. in one sample of the assay, and the resulting increase/decrease in transcription 
can be detected by measuring the resulting increase in RNA/ protein expression, etc., according to 
the methods herein. It will be appreciated that modulators can also be combined with 
transcriptional activators or inhibitors to find modulators which inhibit transcriptional activation 
or transcriptional repression. Either expression of the nucleic acids and proteins herein or any 
additional nucleic acids or proteins activated by the nucleic acids or proteins herein, or both, can 
be monitored. 

In an embodiment, the invention provides a method for identifying compositions 
that modulate the activity or expression of a polynucleotide or polypeptide of the invention. For 
example, a test compound, whether a small or large molecule, is placed in contact with a cell, 
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pknt (or plant tissue or explant), or composition comprising the polynucleotide or polypeptide of 
interest and a resulting effect on the cell, plant, (or tissue or explant) or composition is evaluated 
by monitoring, either directly or indirectly, one or more of: expression level of the polynucleotide 
or polypeptide, activity (or modulation of the activity) of the polynucleotide or polypeptide. In 
some cases, an alteration in a plant phenotype can be detected following contact of a plant (or 
plant cell, or tissue or explant) with the putative modulator, e.g., by modulation of expression or 
activity of a polynucleotide or polypeptide of the invention. 

SUBSEQUENCES 

Also contemplated are uses of polynucleotides, also referred to herein as 

oligonucleotides, typically having at least 12 bases, preferably at least 15, more preferably at least 

20, 30, or 50 bases, which hybridize under at least highly stringent (or ultra-high stringent or 

ultra-ultra- high stringent conditions) conditions to a polynucleotide sequence described above. 

The polynucleotides may be used as probes, primers, sense and antisense agents, and the like, 

according to methods as noted supra. 

Subsequences of the polynucleotides of the invention, including polynucleotide 
fragments and oligonucleotides are useful as nucleic acid probes and primers. An oligonucleotide 
suitable for use as a probe or primer is at least about 15 nucleotides in length, more often at least 
about 18 nucleotides, often at least about 21 nucleotides, frequently at least about 30 nucleotides, 
or about 40 nucleotides, or more in length. A nucleic acid probe is useful in hybridization 
protocols, e.g., to identify additional polypeptide homologues of the invention, including 
protocols for microarTay experiments. Primers can be annealed to a complementary target DNA 
strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA 
strand, and then extended along the target DNA strand by a DNA polymerase enzyme. Primer 
pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain 
reaction (PCR) or other nucleic-acid amplification methods. See Sambrook and Ausubel, supra. 

In addition, the invention includes an isolated or recombinant polypeptide 
including a subsequence of at least about 15 contiguous amino acids encoded by the recombinant 
or isolated polynucleotides of the invention. For example, such polypeptides, or domains or 
fragments thereof, can be used as immunogens, e.g., to produce antibodies specific for the 
polypeptide sequence, or as probes for detecting a sequence of interest. A subsequence can range 
in size from about 15 amino acids in length up to and including the full length of the polypeptide. 
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PRODUCTION OF TRANSGENIC PLANTS 
Modification of Traits 

The polynucleotides of the invention are favorably employed to produce 
transgenic plants with various traits, or characteristics, that have been modified in a desirable 
5 manner, e.g., to improve the seed characteristics of a plant. For example, alteration of expression 
levels or patterns (e.g., spatial or temporal expression patterns) of one or more of the transcription 
factors (or transcription factor homologues) of the invention, as compared with the levels of the 
same protein found in a wild type plant, can be used to modify a plant's traits. An illustrative 
example of trait modification, modified structure and development characteristics, by altering 
10 expression levels of a particular transcription factor is described further in the Examples and the 
Sequence Listing. 

Antisense and Cosuppression Approaches 

In addition to expression of the nucleic acids of the invention as gene 
replacement or plant phenotype modification nucleic acids, the nucleic acids are also useful for 

15 sense and anti-sense suppression of expression, e.g., to down-regulate expression of a nucleic 
acid of the invention, e.g., as a further mechanism for modulating plant phenotype. That is, the 
nucleic acids of the invention, or subsequences or anti-sense sequences thereof, can be used to 
block expression of naturally occurring homologous nucleic acids. A variety of sense and anti- 
sense technologies are known in the art, e.g., as set forth in Lichtenstein and Nellen (1997) 

20 Antisense Technology: A Practical Approach IRL Press at Oxford University, Oxford, England. 
In general, sense or anti-sense sequences are introduced into a cell, where they are optionally 
amplified, e.g., by transcription. Such sequences include both simple oligonucleotide sequences 
and catalytic sequences such as ribozymes. 

For example, a reduction or elimination of expression (i.e., a "knock-out") of a 

25 transcription factor or transcription factor homologue polypeptide in a transgenic plant, e.g., to 
modify a plant trait, can be obtained by introducing an antisense construct corresponding to the 
polypeptide of interest as a cDNA. For antisense suppression, the transcription factor or homologue 
cDNA is arranged in reverse orientation (with respect to the coding sequence) relative to the 
promoter sequence in the expression vector. The introduced sequence need not be the full length 

30 cDNA or gene, and need not be identical to the cDNA or gene found in the plant type to be 

transformed. Typically, the antisense sequence need only be capable of hybridizing to the target 
gene or RNA of interest. Thus, where the introduced sequence is of shorter length, a higher 
degree of homology to the endogenous transcription factor sequence will be needed for effective 
antisense suppression. While antisense sequences of various lengths can be utilized, preferably, 
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the introduced antisense sequence in the vector will be at least 30 nucleotides in length, and 
improved antisense suppression will typically be observed as the length of the antisense sequence 
increases. Preferably, the length of the antisense sequence in the vector will be greater than 100 
nucleotides. Transcription of an antisense construct as described results in the production of 
5 RNA molecules that are the reverse complement of mRNA molecules transcribed from the 
endogenous transcription factor gene in the plant cell. 

Suppression of endogenous transcription factor gene expression can also be 
achieved using a ribozyme. Ribozymes are RNA molecules that possess highly specific 
endoriboniiclease activity. The production and use of ribozymes are disclosed in U.S. Patent No. 

10 4,987,071 and U.S. Patent No. 5,543,508. Synthetic ribozyme sequences including antisense 
RNAs can be used to confer RNA cleaving activity on the antisense RNA, such that endogenous 
mRNA molecules that hybridize to the antisense RNA are cleaved, which in turn leads to an 
enhanced antisense inhibition of endogenous gene expression. 

Vectors in which RNA encoded by a transcription factor or transcription factor 

1 5 homologue cDNA is over-expressed can also be used to obtain co-suppression of a corresponding 
endogenous gene, e.g., in the manner described in U.S. Patent No. 5,23 1,020 to Jorgensen. Such 
co-suppression (also termed sense suppression) does not require that the entire transcription factor 
cDNA be introduced into the plant cells, nor does it require that the introduced sequence be 
exactly identical to the endogenous transcription factor gene of interest. However, as with 

20 antisense suppression, the suppressive efficiency will be enhanced as specificity of hybridization 
is increased, e.g., as the introduced sequence is lengthened, and/or as the sequence similarity 
between the introduced sequence and the endogenous transcription factor gene is increased. 

Vectors expressing an untranslatable form of the transcription factor mRNA, e.g., 
sequences comprising one or more stop codon, or nonsense mutation) can also be used to 

25 suppress expression of an endogenous transcription factor, thereby reducing or eliminating it's 
activity and modifying one or more traits. Methods for producing such constructs are described 
in U.S. Patent No. 5,583,021. Preferably, such constructs are made by introducing a premature 
stop codon into the transcription factor gene. Alternatively, a plant trait can be modified by gene 
silencing using double-strand RNA (Sharp (1999) Genes and Development 13: 139-141). 

30 Another method for abolishing the expression of a gene is by insertion 

mutagenesis using the T-DNA of Agrobacterium tumefaciens. After generating the insertion 
mutants, the mutants can be screened to identify those containing the insertion in a transcription 
factor or transcription factor homologue gene. Plants containing a single transgene insertion 
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event at the desired gene can be crossed to generate homozygous plants for the mutation (Koncz 
et al. (1992) Methods in Arabidopsis Research, World Scientific). 

Alternatively, a plant phenotype can be altered by eliminating an endogenous 
gene, such as a transcription factor or transcription factor homologue, e.g., by homologous 
recombination (Kempin et al. (1997) Nature 389:802). 

A plant trait can also be modified by using the cre-iox system (for example, as 
described in US Pat. No. 5,658,772). A plant genome can be modified to include first and 
second lox sites that are then contacted with a Cre recombinase. If the lox sites are in the same 
orientation, the intervening DNA sequence between the two sites is excised. If the lox sites are in 
the opposite orientation, the intervening sequence is inverted. 

The polynucleotides and polypeptides of this invention can also be expressed in a 
plant in the absence of an expression cassette by manipulating the activity or expression level of 
the endogenous gene by other means. For example, by ectopically expressing a gene by T-DNA 
activation tagging (Ichikawa et al. (1997) Nature 390 698-701; Kakimoto et al. (1996) Science 
274: 982-985). This method entails transforming a plant with a gene tag containing multiple 
transcriptional enhancers and once the tag has inserted into the genome, expression of a flanking 
gene coding sequence becomes deregulated. In another example, the transcriptional machinery in 
a plant can be modified so as to increase transcription levels of a polynucleotide of the invention 
(See, e.g., PCT Publications WO 96/06166 and WO 98/53057 which describe the modification of 
the DNA binding specificity of zinc finger proteins by changing particular amino acids in the 
DNA binding motif). 

The transgenic plant can also include the machinery necessary for expressing or 
altering the activity of a polypeptide encoded by an endogenous gene, for example by altering the 
phosphorylation state of the polypeptide to maintain it in an activated state. 

Transgenic plants (or plant cells, or plant explants, or plant tissues) incorporating 
the polynucleotides of the invention and/or expressing the polypeptides of the invention can be 
produced by a variety of well established techniques as described above. Following construction 
of a vector, most typically an expression cassette, including a polynucleotide, e.g., encoding a 
transcription factor or transcription factor homologue, of the invention, standard techniques can 
be used to introduce the polynucleotide into a plant, a plant cell, a plant explant or a plant tissue 
of interest. Optionally, the plant cell, explant or tissue can be regenerated to produce a transgenic 
plant. 

The plant can be any higher plant, including gymnosperms, monocotyledonous 
and dicotyledenous plants. Suitable protocols are available for Leguminosae (alfalfa, soybean, 
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clover, etc.), Umbelliferae (carrot, celery, parsnip), Cruciferae (cabbage, radish, rapeseed, 
broccoli, etc.), Curcurbitaceae (melons and cucumber), Gramineae (wheat, corn, rice, barley, 
millet, etc.), Solanaceae (potato, tomato, tobacco, peppers, etc.), and various other crops. See 
protocols described in Ammirato et al. (1984) Handbook of Plant Cell Culture -Crop Species . 
Macmillan Publ. Co. Shimamoto et al. (1989) Nature 338:274-276: Fromm et al. (1990) 
Bio/Technoloev 8:833-839; and Vasil et al. (1990) Bio/Technology 8:429-434. 

Transformation and regeneration of both monocotyledonous and dicotyledonous 
plant cells is now routine, and the selection of the most appropriate transformation technique will 
be determined by the practitioner. The choice of method will vary with the type of plant to be 
transformed; those skilled in the art will recognize the suitability of particular methods for given 
plant types. Suitable methods can include, but are not limited to: electroporation of plant 
protoplasts; liposome-mediated transformation; polyethylene glycol (PEG) mediated 
transformation; transformation using viruses; micro-injection of plant cells; micro-projectile 
bombardment of plant cells; vacuum infiltration; and Agrobacterium tumeficiens mediated 
transformation. Transformation means introducing a nucleotide sequence in a plant in a manner to 
cause stable or transient expression of the sequence. 

Successful examples of the modification of plant characteristics by 
transformation with cloned sequences which serve to illustrate the current knowledge in this field 
of technology, and which are herein incorporated by reference, include: U.S. Patent Nos. 
5,571,706; 5,677,175; 5,510,471; 5,750,386; 5,597,945; 5,589,615; 5,750,871; 5,268,526; 
5,780,708; 5,538,880; 5,773,269; 5,736,369 and 5,610,042. 

Following transformation, plants are preferably selected using a dominant 
selectable marker incorporated into the transformation vector. Typically, such a marker will 
confer antibiotic or herbicide resistance on the transformed plants, and selection of transformants 
can be accomplished by exposing the plants to appropriate concentrations of the antibiotic or 
herbicide. 

After transformed plants are selected and grown to maturity, those plants 
showing a modified trait are identified. The modified trait can be any of those traits described 
above. Additionally, to confirm that the modified trait is due to changes in expression levels or 
activity of the polypeptide or polynucleotide of the invention can be determined by analyzing 
mRNA expression using Northern blots, RT-PCR or microarrays, or protein expression using 
immunoblots or Western blots or gel shift assays. 
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INTEGRATED SYSTEMS— SEQUENCE IDENTITY 

Additionally, the present invention may be an integrated system, computer or 
computer readable medium that comprises an instruction set for determining the identity of one or 
more sequences in a database. In addition, the instruction set can be used to generate or identify 
5 sequences that meet any specified criteria. Furthermore, the instruction set may be used to 
associate or link certain functional benefits, such modified structure and development 
characteristics, with one or more identified sequence. 

For example, the instruction set can include, e.g., a sequence comparison or other 
alignment program, e.g., an available program such as, for example, the Wisconsin Package 

1 0 Version 10.0, such as BLAST, FASTA, PILEUP, FINDPATTERNS or the like (GCG, Madision, 
WI). Public sequence databases such as GenBank, EMBL, Swiss-Prot and PIR or private 
sequence databases such as PhytoSeq (Incyte Pharmaceuticals, Palo Alto, CA) can be searched. 

Alignment of sequences for comparison can be conducted by the local homology 
algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2:482, by the homology alignment 

15 algorithm of Needleman and Wunsch (1970) J. MoL Biol. 48:443, by the search for similarity 
method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. U.S.A . 85: 2444, by computerized 
implementations of these algorithms. After alignment, sequence comparisons between two (or 
more) polynucleotides or polypeptides are typically performed by comparing sequences of the 
two sequences over a comparison window to identify and compare local regions of sequence 

20 similarity. The comparison window can be a segment of at least about 20 contiguous positions, 
usually about 50 to about 200, more usually about 100 to about 150 contiguous positions. A 
description of the method is provided in Ausubel et ah, supra. 

A variety of methods of determining sequence relationships can be used, 
including manual alignment and computer assisted sequence alignment and analysis. This later 

25 approach is a preferred approach in the present invention, due to the increased throughput 
afforded by computer assisted methods. As noted above, a variety of computer programs for 
performing sequence alignment are available, or can be produced by one of skill. 

One example algorithm that is suitable for determining percent sequence identity 
and sequence similarity is the BLAST algorithm, which is described in Altschul et al. J. Mol. Biol 

30 215:403-410 (1990). Software for performing BLAST analyses is publicly available, e.g., 

through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 
algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
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referred to as the neighborhood word score threshold (Altschul et al, supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. 
The word hits are then extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, for 
5 nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 
0) and N (penalty score for mismatching residues; always < 0). For amino acid sequences, a 
scoring matrix is used to calculate the cumulative score. Extension of the word hits in each 
direction are halted when: the cumulative alignment score falls off by the quantity X from its 
maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of 

10 one or more negative-scoring residue alignments; or the end of either sequence is reached. The 
BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. 
The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an 
expectation (E) of 10, a cutoff of 100, M=5, N=-4, and a comparison of both strands. For amino 
acid sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) 

15 of 10, and the BLOSUM62 scoring matrix {see Henikoff & Henikoff H989 ) Proc. Natl. Acad. 
SdJLJSA 89:10915). 

In addition to calculating percent sequence identity, the BLAST algorithm also 
performs a statistical analysis of the similarity between two sequences (see, e.g.,.Karlin & 
Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5787). One measure of similarity provided 

20 by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of 
the probability by which a match between two nucleotide or amino acid sequences would occur 
by chance. For example, a nucleic acid is considered similar to a reference sequence (and, 
therefore, in this context, homologous) if the smallest sum probability in a comparison of the test 
nucleic acid to the reference nucleic acid is less than about 0.1, or less than about 0.01, and or 

25 even less than about 0.001 . An additional example of a useful sequence alignment algorithm is 
P1LEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using 
progressive, pairwise alignments. The program can align, e.g., up to 300 sequences of a 
maximum length of 5,000 letters. 

The integrated system, or computer typically includes a user input interface 

30 allowing a user to selectively view one or more sequence records corresponding to the one or 

more character strings, as well as an instruction set which aligns the one or more character strings 
with each other or with an additional character string to identify one or more region of sequence 
similarity. The system may include a link of one or more character strings with a particular 



32 



WO 01/36444 



PCT/US00/31325 



phenotype or gene function.' Typically, the system includes a user readable output element which 
displays an alignment produced by the alignment instruction set. 

The methods of this invention can be implemented in a localized or distributed 
computing environment. In a distributed environment, the methods may implemented on a single 
5 computer comprising multiple processors or on a multiplicity of computers. The computers can 
be linked, e.g. through a common bus, but more preferably the computer(s) are nodes on a 
network. The network can be a generalized or a dedicated local or wide-area network and, in 
certain preferred embodiments, the computers may be components of an intra-net or an internet. 

Thus, the invention provides methods for identifying a sequence similar or 

10 homologous to one or more polynucleotides as noted herein, or one or more target polypeptides 
encoded by the polynucleotides, or otherwise noted herein and may include linking or associating 
a given plant phenotype or gene function with a sequence. In the methods, a sequence database is 
provided (locally or across an inter or intra net) and a query is made against the sequence 
database using the relevant sequences herein and associated plant phenotypes or gene functions. 

1 5 Any sequence herein can be entered into the database, before or after querying 

the database. This provides for both expansion of the database and, if done before the querying 
step, for insertion of control sequences into the database. The control sequences can be detected 
by the query to ensure the general integrity of both the database and the query. As noted, the 
query can be performed using a web browser based interface. For example, the database can be a 

20 centralized public database such as those noted herein, and the querying can be done from a 
remote terminal or computer across an internet or intranet. 

EXAMPLES 

The following examples are intended to illustrate but not limit the present 

invention. 

25 EXAMPLE I. FULL LENGTH GENE IDENTIFICATION AND CLONING 

Putative transcription factor sequences (genomic or ESTs) related to known 
transcription factors were identified in the Arabidopsis thaliana GenBank database using the 
tblastn sequence analysis program using default parameters and a P-value cutoff threshold of -4 
or -5 or lower, depending on the length of the query sequence. Putative transcription factor 

30 sequence hits were then screened to identify those containing particular sequence strings. If the 
sequence hits contained such sequence strings, the sequences were confirmed as transcription 
factors. 



33 



WO 01/36444 PCT/USOO/31325 

Alternatively, Arabidopsis thaliana cDNA libraries derived from different tissues 
or treatments, or genomic libraries were screened to identify novel members of a transcription 
family using a low stringency hybridization approach. Probes were synthesized using gene 
specific primers in a standard PCR reaction (annealing temperature 60° C) and labeled with 32 P 
5 dCTP using the High Prime DNA Labeling Kit (Boehringer Mannheim). Purified radiolabeled 
probes were added to filters immersed in Church hybridization medium (0.5 M NaP0 4 pH 7.0, 
7% SDS, 1 % w/v bovine serum albumin) and hybridized overnight at 60 °C with shaking. Filters 
were washed two times for 45 to 60 minutes with lxSCC, 1% SDS at 60° C. 

To identify additional sequence 5 f or 3* of a partial cDNA sequence in a cDNA 

1 0 library, 5' and 3' rapid amplification of cDNA ends (RACE) was performed using the Marathon™ 
cDNA amplification kit (Clontech, Palo Alto, CA). Generally, the method entailed first isolating 
poly(A) mRNA, performing first and second strand cDNA synthesis to generate double stranded 
cDNA, blunting cDNA ends, followed by ligation of the Marathon™ Adaptor to the cDNA to 
form a library of adaptor-ligated ds cDNA. 

1 5 Gene-specific primers were designed to be used along with adaptor specific 

primers for both 5' and 3' RACE reactions. Nested primers, rather than single primers, were used 
to increase PCR specificity. Using 5' and 3* RACE reactions, 5* and 3' RACE fragments were 
obtained, sequenced and cloned. The process can be repeated until 5' and 3' ends of the full- 
length gene were identified. Then the full-length cDNA was generated by PCR using primers 

20 specific to 5' and 3' ends of the gene by end-to-end PCR. 

EXAMPLE II CONSTRUCTION OF EXPRESSION VECTORS 

The sequence was amplified from a genomic or cDNA library using primers 
specific to sequences upstream and downstream of the coding region. The expression vector was 
pMEN20 or pMEN65, which are both derived from pMON3 16 (Sanders et al, (1987 ) Nucleic 

25 Acids Research 15:1543-58) and contain the CaMV 35S promoter to express transgenes. To 

clone the sequence into the vector, both pMEN20 and the amplified DNA fragment were digested 
separately with Sail and NotI restriction enzymes at 37° C for 2 hours. The digestion products 
were subject to electrophoresis in a 0.8% agarose gel and visualized by ethidium bromide 
staining. The DNA fragments containing the sequence and the linearized plasmid were excised 

30 and purified by using a Qiaquick gel extraction kit (Qiagen, CA). The fragments of interest were 
ligated at a ratio of 3 : 1 (vector to insert). Ligation reactions using T4 DNA ligase (New England 
Biolabs, MA) were carried out at 16° C for 16 hours. The ligated DNAs were transformed into 
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competent cells of the E. coli strain DHSalpha by using the heat shock method. The 
transformations were plated on LB plates containing 50 mgA kanamycin (Sigma). 

Individual colonies were grown overnight in five milliliters of LB broth 
containing 50 mg/1 kanamycin at 37° C. Plasmid DNA was purified by using Qiaquick Mini 
5 Prep kits (Qiagen, CA). 

EXAMPLE III TRANSFORMATION OF A GROBA CTERIUM WITH THE EXPRESSION 
VECTOR 

After the plasmid vector containing the gene was constructed, the vector was 
used to transform Agrobacterium tumefaciens cells expressing the gene products. The stock of 

10 Agrobacterium tumefaciens cells for transformation were made as described by Nagel et al. 

(1990) FEMS Microbiol Letts . 67: 325-328. Agrobacterium strain ABI was grown in 250 ml LB 
medium (Sigma) overnight at 28°C with shaking until an absorbance (A$oo) of 0.5 - 1 .0 was 
reached. Cells were harvested by centrifugation at 4,000 x g for 15 min at 4°C. Cells were then 
resuspended in 250 \i\ chilled buffer (1 mM HEPES, pH adjusted to 7.0 with KOH). Cells were 

15 centrifuged again as described above and resuspended in 125 \il chilled buffer. Cells were then 
centrifuged and resuspended two more times in the same HEPES buffer as described above at a 
volume of 100 \il and 750 }il, respectively. Resuspended cells were then distributed into 40 |il 
aliquots, quickly frozen in liquid nitrogen, and stored at -80° C. 

Agrobacterium cells were transformed with plasmids prepared as described 

20 above following the protocol described by Nagel et al. For each DNA construct to be 

transformed, 50 - 100 ng DNA (generally resuspended in 10 mM Tris-HCl, I mM EDTA, pH 
8.0) was mixed with 40 jil of Agrobacterium cells. The DNA/cell mixture was then transferred to 
a chilled cuvette with a 2mm electrode gap and subject to a 2.5 kV charge dissipated at 25 ^F and 
200 nF using a Gene Pulser II apparatus (Bio-Rad). After electroporation, cells were 

25 immediately resuspended in 1 .0 ml LB and allowed to recover without antibiotic selection for 2 - 
4 hours at 28° C in a shaking incubator. After recovery, cells were plated onto selective medium 
of LB broth containing 100 ng/ml spectinomycin (Sigma) and incubated for 24-48 hours at 28° C. 
Single colonies were then picked and inoculated in fresh medium. The presence of the plasmid 
construct was verified by PCR amplification and sequence analysis. 

30 EXAMPLE IV. TRANSFORMATION OF ARABIDOPSIS PLANTS WITH AGROBACTERIUM 
TUMEFACIENS WITH EXPRESSION VECTOR 

After transformation of Agrobacterium tumefaciens with plasmid vectors 

containing the gene, single Agrobacterium colonies were identified, propagated, and used to 
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transform Arabidopsis plants. Briefly, 500 ml cultures of LB medium containing 50 mg/1 
kanamycin were inoculated with the colonies and grown at 28° C with shaking for 2 days until an 
absorbance (A$o 0 ) of > 2.0 is reached. Cells were then harvested by centrifugation at 4,000 x g 
for 10 min, and resuspended in infiltration medium (1/2 X Murashige and Skoog salts (Sigma), 1 
5 X Gamborg's B-5 vitamins (Sigma), 5.0% (w/v) sucrose (Sigma), 0.044 \iM benzylamino purine 
(Sigma), 200 \xVL Silwet L-77 (Lehle Seeds) until an absorbance (A 6 oo) of 0.8 was reached. 

Prior to transformation, Arabidopsis thaliana seeds (ecotype Columbia) were 
sown at a density of -10 plants per 4" pot onto Pro-Mix BX potting medium (Hummert 
International) covered with fiberglass mesh (18 mm X 16 mm). Plants were grown under 

10 continuous illumination (50-75 ^E/m 2 /sec) at 22-23° C with 65-70% relative humidity. After 
about 4 weeks, primary inflorescence stems (bolts) are cut off to encourage growth of multiple 
secondary bolts. After flowering of the mature secondary bolts, plants were prepared for 
transformation by removal of all siliques and opened flowers. 

The pots were then immersed upside down in the mixture of Agrobacterium 

1 5 infiltration medium as described above for 30 sec, and placed on their sides to allow draining into 
a 1* x 2* flat surface covered with plastic wrap. After 24 h, the plastic wrap was removed and 
pots are turned upright. The immersion procedure was repeated one week later, for a total of two 
immersions per pot. Seeds were then collected from each transformation pot and analyzed 
following the protocol described below. 

20 EXAMPLE V. IDENTIFICATION OF ARABIDOPSIS PRIMARY TRANSFORMANTS 

Seeds collected from the transformation pots were sterilized essentially as 
follows. Seeds were dispersed into in a solution containing 0. 1% (v/v) Triton X-100 (Sigma) and 
sterile H 2 0 and washed by shaking the suspension for 20 min. The wash solution was then 
drained and replaced with fresh wash solution to wash the seeds for 20 min with shaking. After 

25 removal of the second wash solution, a solution containing 0.1% (v/v) Triton X-100 and 70% 
ethanol (Equistar) was added to the seeds and the suspension was shaken for 5 min. After 
removal of the ethanol/detergent solution, a solution containing 0.1% (v/v) Triton X-100 and 30% 
(v/v) bleach (Clorox) was added to the seeds, and the suspension was shaken for 10 min. After 
removal of the bleach/detergent solution, seeds were then washed five times in sterile distilled 

30 H 2 0. The seeds were stored in the last wash water at 4° C for 2 days in the dark before being 

plated onto antibiotic selection medium (1 X Murashige and Skoog salts (pH adjusted to 5.7 with 
1M KOH), 1 X Gamborg's B-5 vitamins, 0.9% phytagar (Life Technologies), and 50 mg/1 
kanamycin). Seeds were germinated under continuous illumination (50-75 ^E/m 2 /sec) at 22-23° 
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C. After 7-10 days of growth under these conditions, kanamycin resistant primary transformants 
(Ti generation) were visible and obtained. These seedlings were transferred first to fresh 
selection plates where the seedlings continued to grow for 3-5 more days, and then to soil (Pro- 
Mix BX potting medium). 
5 Primary transformants were crossed and progeny seeds (T 2 ) collected; kanamycin 

resistant seedlings were selected and analyzed. The expression levels of the recombinant 
polynucleotides in the transformants varies from about a 5% expression level increase to a least a 
100% expression level increase. Similar observations are made with respect to polypeptide level 
expression. 

10 

EXAMPLE VI. IDENTIFICATION OF ARABIDOPSIS PLANTS WITH TRANSCRIPTION 
FACTOR GENE KNOCKOUTS 

The screening of insertion mutagenized Arabidopsis collections for null mutants 

in a known target gene was essentially as described in Krysan et al (1999) Plant Cell 1 1:2283- 

1 5 2290. Briefly, gene-specific primers, nested by 5-250 pb to each others, were designed from the 
5' and 3' regions of a known target gene. Similarly, nested sets of primers were also created 
specific to each of the T-DNA or transposon ends (the "right" and "left" borders). All possible 
combinations of gene specific and T-DNA/transposon primers were used to detect by PCR an 
insertion event within or close to the target gene. The amplified DNA fragments were then 

20 sequenced which allows the precise determination of the T-DNA/transposon insertion point 
relative to the target gene. Insertion events within the coding or intervening sequence of the 
genes were deconvoiuted from a pool comprising a plurality of insertion events to a single unique 
mutant plant for functional characterization. The method is described in more detail in Yu and 
Adam, US Application Serial No. 09/177,733 filed October 23, 1998. 

25 EXAMPLE VP. IDENTIFICATION OF STRUCTURE AND DEVELOPMENT 

CHARACTERISTICS PHENOTYPE IN OVEREXPRESSOR OR GENE KNOCKOUT 
PLANTS 

Experiments were performed to identify those transformants or knockouts that 
exhibited a modified structure and development characteristics. For such studies, the 
30 transformants were observed by eye to identify novel structural or developmental characteristics 
associated with the ectopic expression of the polynucleotides or polypeptides of the invention. 

Table 3 shows the phenotypes observed for particular overexpressor or knockout 
plants and provides the SEQ ID No., the internal reference code (GID), whether a knockout or 
overexpressor plant was analyzed and the observed phenotype. 
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Table 3 



SEQ ID No. 


GID 


Knockout (KO) or 
overexpressor (KO) 


Phenotype observed 


1 


G727 


OE 


Plants were small, and more dark green in 
color, late flowering and poorly fertile. 


3 


G732 


OE 


Plants were small and inflorescence was 
unelongated. Flowers parts appeared to be ton- 
elongated and the plants were semi-sterile. 


5 


G9 


OE 


Increased root mass 


7 


G428 


OE 


Lobed and highly serrated leaves and 
abnormal first and second whorl floral organs 


9 


G869 


OE 


Undeveloped or small anthers 


11 


G1269 


OE 


Extended petioles and leaves pointed upwards 


13 


G1038 


OE 


Altered leaf shape 


15 


G438 


KO 


Reduced lignin in stem 


17 


G571 


KO 


Delayed senescence at the end of the plant 
lifecycle 


19 


G748 


OE 


More vascular bundles in stem 


21 


G431 


OE 


Severe developmental abnormalities such as 

flltwpH rYTflnrViin ex t^x/ictpH rncpfff* 1<*;ivpq 
aiititu xji aiit/iiiiig , LWisicu lUovLlv lVyaV&O) 

flowers with missing pistils, fused stamens and 
atvnical numbers of netals and stamens 
reduced secondary bolts, and lack of cauline 
leaves. 


23 


G187 


OE 


Plants had long, thin cotyledons and reduced 
anical dominance Several flower 
abnormalities, including underdeveloped, 
sepaloid petals and underdeveloped anthers 
were also observed. 


25 


G470 


OE 


Plants were sterile due to failure of anthers to 
elongate 


27 


G615 


OE 


Plants were sterile due to failure of anthers to 
develop and failure of stamens to elongate. 
Fused cotyledons and absence of a shoot apical 
meristem and true leaves was also observed. 


29 


G1073 


OE 


Increased plant size and serrated leaves 



For a particular overexpressor that shows a less beneficial structure and development 
characteristic, it may be more useful to select a plant with a decreased expression of the particular 
transcription factor. For a particular knockout that shows a less beneficial structure and 
development characteristic, it may be more useful to select a plant with an increased expression 
of the particular transcription factor. 
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EXAMPLE VHI. IDENTIFICATION OF HOMOLOGOUS SEQUENCES 

Homologous sequences from Arabidopsis and plant species other than Arabidopsis were 
identified using database sequence search tools, such as the Basic Local Alignment Search Tool 
(BLAST) (Altschul et al. (1990) J. Mol. Biol. 215:403^410; and Altschul et al. (1997) Nucl. Acid 
5 Res. 25: 3389-3402). The tblastx sequence analysis programs were employed using the 

BLOSUM-62 scoring matrix (Henikoff, S. and Henikoff, J. G. (1992) Proc. Natl. Acad. Sci. USA 
89: 10915-10919). 

Identified Arabidopsis homologous sequences are provided in Figure 2 and included in 
the Sequence Listing. The percent sequence identity among these sequences is as low as 47% 

10 sequence identity. Additionally, the entire NCBI GenBank database was filtered for sequences 
from all plants except Arabidopsis thaliana by selecting all entries in the NCBI GenBank 
database associated with NCBI taxonomic ED 33090 (Viridiplantae; all plants) and excluding 
entries associated with taxonomic ID 3701 {Arabidopsis thaliana). These sequences were 
compared to sequences representing genes of SEQ IDs Nos. 1-54 on 9/26/2000 using the 

15 Washington University TBLASTX algorithm (version 2.0al9MP). For each gene of SEQ IDs 
Nos. 1-54, individual comparisons were ordered by probability score (P- value), where the score 
reflects the probability that a particular alignment occurred by chance. For example, a score of 
3.6e-40 is 3.6 x 10" 40 . For up to ten species, the gene with the lowest P-value (and therefore the 
most likely homolog) is listed in Figure 3. 

20 In addition to P-values, comparisons were also scored by percentage identity. Percentage 

identity reflects the degree to which two segments of DNA or protein are identical over a 
particular length. The ranges of percent identity between the non-Arabidopsis genes shown in 
Figure 3 and the Arabidopsis genes in the sequence listing are: SEQ ED No. 1: 36%-69%; SEQ ID 
No. 3: 46%-54%; SEQ ID No. 5: 57%-72%; SEQ ID No. 7: 54%-69%; SEQ ID No. 9: 3 l%-68%; 

25 SEQ ID No. 1 1 : 47%-90%; SEQ ID No. 1 3: 34%-82%; SEQ ID No. 1 5: 49%-88%; SEQ ID No. 
17: 56%-67%; SEQ ID No. 19: 39%-61%; SEQ ID No. 21: 61%-87%; SEQ ID No. 23: 38%- 
85%; SEQ ID No. 25: 44%-94%; SEQ ID No. 27: 35%^4%; SEQ ID No. 29: 37%-71%; SEQ ID 
No. 31: 38%-77%; SEQ ID No. 33: 57%-69%; SEQ ID No. 35: 54%-69%; SEQ ID No. 37: 60%- 
75%; SEQ ID No. 39: 47%-65%; SEQ ID No. 41: 60%-88%; SEQ ID No. 43: 43%-87%; and 

30 SEQ ID No. 45: 53%-97%. 

The polynucleotides and polypeptides in the Sequence Listing and the identified 
homologous sequences may be stored in a computer system and have associated or linked with 
the sequences a function, such as that the polynucleotides and polypeptides are useful for 
modifying the structure and development characteristics of a plant. 
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All references, publications, patents and other documents herein are incorporated by 
reference in their entirety for all purposes. Although the invention has been described with 
reference to the embodiments and examples above, it should be understood that various 
modifications can be made without departing from the spirit of the invention. 
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What is claimed is: 

1 . A transgenic plant with modified structure and development characteristics, which plant 
comprises a recombinant polynucleotide comprising a nucleotide sequence selected from the 
group consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising a sequence selected from 
SEQ ID Nos. 2N, where N=l-23, or a complementary nucleotide sequence thereof; 

(b) a nucleotide sequence encoding a polypeptide comprising a conservatively substituted 
variant of a polypeptide of (a); 

(c) a nucleotide sequence comprising a sequence selected from those of SEQ ID Nos. 2N- 
1, where N=l-23, or a complementary nucleotide sequence thereof; 

(d) a nucleotide sequence comprising silent substitutions in a nucleotide sequence of (c); 

(e) a nucleotide sequence which hybridizes under stringent conditions to a nucleotide 
sequence of one or more of: (a), (b), (c), or (d); 

(f) a nucleotide sequence comprising at least 15 consecutive nucleotides of a sequence of 
any of (a)-(e); 

(g) a nucleotide sequence comprising a subsequence or fragment of any of (a)-(f). which 
subsequence or fragment encodes a polypeptide that modifies a plant's structure and 
development characteristics; 

(h) a nucleotide sequence having at least 31% sequence identity to a nucleotide sequence 
of any of (a)-(g); 

(i) a nucleotide sequence having at least 60% identity sequence identity to a nucleotide 
sequence of any of (a)-(g); 

(j) a nucleotide sequence which encodes a polypeptide having at least 31% identity 

sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; 

(k) a nucleotide sequence which encodes a polypeptide having at least 60% identity 

sequence identity to a polypeptide of SEQ ED Nos. 2N, where N=l-23; and 

(1) a nucleotide sequence which encodes a polypeptide having at least 65% sequence 

identity to a conserved domain of a polypeptide of SEQ ID Nos. 2N, where N=l-23. 

2. The transgenic plant of claim 1 , further comprising a constitutive, inducible, or tissue- 
active promoter operably linked to said nucleotide sequence. 

3. The transgenic plant of claim 1 , wherein the plant is selected from the group consisting 
of: soybean, wheat, corn, potato, cotton, rice, oilseed rape, sunflower, alfalfa, sugarcane, turf, 
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banana, blackberry, blueberry, strawberry, raspberry, cantaloupe, carrot, cauliflower, coffee, 
cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, 
pineapple, spinach, squash, sweet corn, tobacco, tomato, watermelon, rosaceous fruits, and 
vegetable brassicas. 

5 

4. An isolated or recombinant polynucleotide comprising a nucleotide sequence selected 

from the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising a sequence selected from 
SEQ ID Nos. 2N, where N=l-23, or a complementary nucleotide sequence thereof; 
10 (b) a nucleotide sequence encoding a polypeptide comprising a conservatively substituted 

variant of a polypeptide of (a); 

(c) a nucleotide sequence comprising a sequence selected from those of SEQ ID Nos. 2N- 
1, where N=l-23, or a complementary nucleotide sequence thereof; 

(d) a nucleotide sequence comprising silent substitutions in a nucleotide sequence of (c); 
1 5 (e) a nucleotide sequence which hybridizes under stringent conditions to a nucleotide 

sequence of one or more of: (a), (b), (c), or (d); 

(f) a nucleotide sequence comprising at least 15 consecutive nucleotides of a sequence of 
any of (a)-(e); 

(g) a nucleotide sequence comprising a subsequence or fragment of any of (a)-(f), which 
20 subsequence or fragment encodes a polypeptide that modifies a plant's structure and 

development characteristics; 

(h) a nucleotide sequence having at least 31% sequence identity to a nucleotide sequence 
ofanyof(a)-(g); 

(i) a nucleotide sequence having at least 60% identity sequence identity to a nucleotide 
25 sequence of any of (a)-(g); 

(j) a nucleotide sequence which encodes a polypeptide having at least 3 1% identity 
sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; 
(k) a nucleotide sequence which encodes a polypeptide having at least 60% identity 
sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; and 
30 (1) a nucleotide sequence which encodes a conserved domain of a polypeptide having at 

least 65% sequence identity to a conserved domain of a polypeptide of SEQ ID Nos. 2N, 
where N=l-23. 
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5. The isolated or recombinant polynucleotide of claim 4, further comprising a constitutive, 
inducible, or tissue-active promoter operably linked to the nucleotide sequence. 

6. A cloning or expression vector comprising the isolated or recombinant polynucleotide of 
claim 4. 

7. . A cell comprising the cloning or expression vector of claim 6. 

8. A transgenic plant comprising the isolated or recombinant polynucleotide of claim 4. 

9. A composition produced by one or more of: 

(a) incubating one or more polynucleotide of claim 4 with a nuclease; 

(b) incubating one or more polynucleotide of claim 4 with a restriction enzyme; 

(c) incubating one or more polynucleotide of claim 4 with a polymerase; 

(d) incubating one or more polynucleotide of claim 4 with a polymerase and a primer; 

(e) incubating one or more polynucleotide of claim 4 with a cloning vector, or 

(f) incubating one or more polynucleotide of claim 4 with a cell. 

10. A composition comprising two or more different polynucleotides of claim 4. 

11. An isolated or recombinant polypeptide comprising a subsequence of at least about 15 
contiguous amino acids encoded by the recombinant or isolated polynucleotide of claim 4. 

12. A plant ectopically expressing an isolated polypeptide of claim 1 1 . 

13. A method for producing a plant having a modified structure and development 
characteristic, the method comprising altering the expression of the isolated or recombinant 
polynucleotide of claim 4 or the expression levels or activity of a polypeptide of claim 1 1 in a 
plant, thereby producing a modified plant, and selecting the modified plant for modified structure 
and development characteristics thereby providing the modified plant with a modified structure 
and development characteristics. 

14. The method of claim 13, wherein the polynucleotide is a polynucleotide of claim 4. 
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15. A method of identifying a factor that is modulated hy or interacts with a polypeptide 
encoded by a polynucleotide of claim 4, the method comprising: 

(a) expressing a polypeptide encoded by the polynucleotide in a plant; and 

(b) identifying at least one factor that is modulated by or interacts with the polypeptide. 

16. The method of claim 15, wherein the identifying is performed by detecting binding by the 
polypeptide to a promoter sequence, or detecting interactions between an additional protein and 
the polypeptide in a yeast two hybrid system. 

17. The method of claim 15, wherein the identifying is performed by detecting expression of 
a factor by hybridization to a microarray, subtractive hybridization or differential display. 

18. A method of identifying a molecule that modulates activity or expression of a 
polynucleotide or polypeptide of interest, the method comprising: 

(a) placing the molecule in contact with a plant comprising the polynucleotide or 
polypeptide encoded by the polynucleotide of claim 4; and, 

(b) monitoring one or more of: 

(i) expression level of the polynucleotide in the plant; 

(ii) expression level of the polypeptide in the plant; 

(iii) modulation of an activity of the polypeptide in the plant; or 

(iv) modulation of an activity of the polynucleotide in the plant. 

19. An integrated system, computer or computer readable medium comprising one or more 
character strings corresponding to a polynucleotide of claim 4, or to a polypeptide encoded by the 
polynucleotide. 

20. The integrated system, computer or computer readable medium of claim 19, further 
comprising a link between said one or more sequence strings to a modified plant structure and 
development characteristics phenotype. 

21 . A method of identifying a sequence similar or homologous to one or more 
polynucleotides of claim 4, or one or more polypeptides encoded by the polynucleotides, the 
method comprising: 

(a) providing a sequence database; and, 
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(b) querying the sequence database with one or more target sequences corresponding to 
the one or more polynucleotides or to the one or more polypeptides to identify one or 
more sequence members of the database that display sequence similarity or homology to 
one or more of the one or more target sequences. 

22. The method of claim 21, wherein the querying comprises aligning one or more of the 
target sequences with one or more of the one or more sequence members in the sequence 
database. 

23. The method of claim 21, wherein the querying comprises identifying one or more of the 
one or more sequence members of the database that meet a user-selected identity criteria with one 
or more of the target sequences. 

24. The method of claim 21, further comprising linking the one or more of the 
polynucleotides of claim 4, or encoded polypeptides, to a modified plant structure and 
development characteristics phenotype. 

25. A plant comprising altered expression levels of an isolated or recombinant polynucleotide 
of claim 4. 

26. A plant comprising altered expression levels or the activity of an isolated or recombinant 
polypeptide of claim 1 1 . 

27. A plant lacking a nucleotide sequence encoding a polypeptide of claim 1 1 . 
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Figure 1 



SEQ ID No. 


GID 


cDNA or protein 


conserved domain 


1 


G727 


cDNA 




2 


G727 


protein 


226-269 


3 


G732 


cDNA 




4 


G732 


protein 


31-9 


5 


G9 


cDNA 




6 


G9 


protein 


62-127 


7 


G428 


cDNA 




8 


G428 


protein 


229-292 


9 


G869 


cDNA 




10 


G869 


protein 


109-177 


11 


G1269 


cDNA 




12 


G1269 


protein 


27-83 


13 


G1038 


cDNA 




14 


G1038 


protein 


198-247 


15 


G438 


cDNA 




16 


G438 


protein 


22-85 


17 


G571 


cDNA 




18 


G571 


protein 


160-220 


19 


G748 


cDNA 




20 


G748 


protein 


112-140 


21 


G431 


cDNA 




22 


G431 


protein 


286-335 


23 


G187 


cDNA 




24 


G187 


protein 


172-228 


25 


G470 


cDNA 




26 


G470 


protein 


61-393 


27 


G615 


cDNA 




28 


G615 


protein 


88-147 


29 


G1073 


cDNA 




30 


G1073 


protein 


33^2, 78-175 
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Figure 2 



SEQ ID No. 


GID 


homolog 


cDNA or protein 


conserved domain 


31 


G1493 


homolog of G727 


cDNA 




32 


G1493 


homolog of G727 


protein 


242-289 


33 


G993 


homolog of G9 


cDNA 




34 


G993 


homolog of G9 


protein 


69-134 


35 


G867 


homolog of G9 


cDNA 




36 


G867 


homolog of G9 


protein 


59-124 


37 


G1930 


homolog of G9 


cDNA 




38 


G1930 


homolog of G9 


protein 


59-124 


39 


G1594 


homolog of G428 


cDNA 




40 


G1594 


homolog of G428 


protein 


262-325 


41 


G391 


homolog of G438 


cDNA 




42 


G391 


homolog of G438 


protein 


25-85 


43 


G390 


homolog of G438 


cDNA 




' 44 


G390 


homolog of G438 


protein 


18-81 


45 


G1548 


homolog of G438 


cDNA 




46 


G1548 


homolog of G438 


protein 


17-77 
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Figure 3A 
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Figure 3B 
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Figure 3C 
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Figure 3D 
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35 
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7722547 


1.30E-33 


Lotus japonicus 


35 


G867 
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3.90E-32 


Lycopersicon pennellii 


35 


G867 


8749037 


1.40E-31 


Citrus x paradisi 


37 


G1930 


7643366 


9.70E-57 


Medicago truncatula 


37 
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37 
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37 
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G1594 
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9.20E-62 


Dendrobium grex Madame Thong-In 


39 


G1594 


4887609 


1.50E-47 


Oryza sativa 


39 


G1594 


1814233 


4.00E-46 


Solanum tuberosum 


39 


G1594 
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6.30E-43 


Nicotiana tabacum 


39 


G1594 


4098241 


6.70E-43 


Lycopersicon esculentum 


39 
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2.00E-42 
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39 


G1594 
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39 


G1594 


4240538 


1.70E-41 


Zea mays 


39 


G1594 


1946219 


1.90E^1 
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41 


G391 


7209474 


4.70E-194 


Oryza sativa 


41 


G391 
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2.10E-145 
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41 
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41 
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41 
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7.00E-60 


Lycopersicon esculentum 


41 


G391 


8330425 


8.60E-59 


Mesembryanthemum crystallinum 


41 


G391 


8284059 


8.70E-57 


Glycine max 


41 


G391 


5269007 


8.10E-46 


Zea mays 


41 


G391 


9419425 


1.70E-43 


Hordeum vulqare 


41 


G391 


2963336 


2.10E-37 


Pinus taeda 
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5.80E-81 


Medicago truncatula 
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Figure 3E 
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7409018 
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43 
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3.00E-60 


Solanum tuberosum ... 
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43 


G390 
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Glycine max 


43 


G390 
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2.60E-44 
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43 


G390 
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4.60E-44 


Zea mays 


43 


G390 


2963336 


4.90E-43 


Pinus taeda 


45 


G1548 
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5.90E-169 


Oryza sativa 


45 


G1548 
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3.30E-140 


Physcomitrella patens 


45 


G1548 
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1.60E-76 


Solanum tuberosum 


. 45. 


G1548 


9820423 


1.40E-67 


Glycine max 


45 


G1548 


7570825 


8.40E-67 
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45 


G1548 
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2.70E-55 


Lycopersicon esculentum 


45 


G1548 
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1.40E-47 


Hordeum vulgare 


45 


G1548 
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3.50E-46 


Zea mays 


45 


G1548 


8330425 


4.20E-46 


Mesembryanthemum crystallinum 


45 


G1548 


3853847 


2.70E-42 


Populus tremula x Populus tremuloides 
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aag age gtt gtg ttg aaa gga gtg act cac ggt gca gtt gat tac etc 

Lys Ser Val Val Leu Lys Gly Val Thr His Gly Ala Val Asp Tyr Leu 

120 125 130 



Page 1 



WO 01/36444 PCT/US00/31325 

MBI0018 Sequence Listing. ST25 

ate aaa ccg gta cgt att gag get ttg aag aat ata tgg caa cat gtg 486 

lie Lys Pro Val Arg lie Glu Ala Leu Lys Asn lie Trp Gin His Val 
135 140 145 

gtg egg aag aag cgt aac gag tgg aat gtt tct gaa cat tct gga gga 534 

Val Arg Lys Lys Arg Asn Glu Trp Asn Val Ser Glu His Ser Gly Gly 
150 155 160 

agt att gaa gat act ggc ggt gac agg gac agg cag cag cag cat agg 582 

Ser lie Glu Asp Thr Gly Gly Asp Arg Asp Arg Gin Gin Gin His Arg 

165 170 175 180 

gag gat get gat aac aac teg tct tea gtt aat gaa ggg aac ggg agg 630 

Glu Asp Ala Asp Asn Asn Ser Ser Ser Val Asn Glu Gly Asn Gly Arg 

185 190 195 

age teg agg aag egg aag gaa gag gaa gta gat gat caa ggg gat gat 678 

Ser Ser Arg Lys Arg Lys Glu Glu Glu Val Asp Asp Gin Gly Asp Asp 
200 205 210 

aag gaa gac tea teg agt tta aag aaa cca cgc gtg gtt tgg tct gtt 726 

Lys Glu Asp Ser Ser Ser Leu Lys Lys Pro Arg Val Val Trp Ser Val 
215 220 225 

gaa ttg cat cag cag ttt gtt get. get gtg aat cag eta ggc gtt gac 774 

Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin Leu Gly Val Asp 
230 235 240 

agt gag tta aaa act tgc ttg ctt atg cat ttg tgt gtg teg att ggt 822 

Ser Glu Leu Lys Thr Cys Leu Leu Met His Leu Cys Val Ser lie Gly 

245 250 255 260 

aac att gtg gaa ttc cag aag tat egg ata tat ctg aga egg ctt gga 870 

Asn lie Val Glu Phe Gin Lys Tyr Arg lie Tyr Leu Arg Arg Leu Gly 

265 270 275 

gga gta teg caa cac caa gga aat atg aac cat teg ttt atg act ggt 918 

Gly Val Ser Gin His Gin Gly Asn Met Asn His Ser Phe Met Thr Gly 
280 285 290 

caa gat cag agt ttt gga cct ctt tct teg ttg aat gga ttt gat ctt 966 

Gin Asp Gin Ser Phe Gly Pro Leu Ser Ser Leu Asn Gly Phe Asp Leu 
295 300 305 

caa tct tta get gtt act ggt cag etc cct cct cag age ctt gca cag 1014 

Gin Ser Leu Ala Val Thr Gly Gin Leu Pro Pro Gin Ser Leu Ala Gin 
310 315 320 

ctt caa gca get ggt ctt ggc egg cct aca etc get aaa cca ggg atg 1062 

Leu Gin Ala Ala Gly Leu Gly Arg Pro Thr Leu Ala Lys Pro Gly Met 

325 330 335 340 

teg gtt tct ccc ctt gta gat cag aga age ate ttc aac ttt gaa aac 1110 

.Ser Val Ser Pro Leu Val Asp Gin Arg Ser lie Phe Asn Phe Glu Asn 

345 350 355 

cca aaa ata aga ttt gga gac gga cat ggt cag acg atg aac aat gga 1158 

Pro Lys lie Arg Phe Gly Asp Gly His Gly Gin Thr Met Asn Asn Gly 
360 365 ^ 370 

aat ttg ctt cat ggt gtc cca acg ggt agt cac atg cgt ctg cgt cct 1206 

Asn Leu Leu His Gly Val Pro Thr Gly Ser His Met Arg Leu Arg Pro 
375 380 385 

gga cag aat gtt cag age age gga atg atg ttg cca gta gca gac cag 1254 

Gly Gin Asn Val Gin Ser Ser Gly Met Met Leu Pro Val Ala Asp Gin 
390 395 400 

eta cct cga gga gga cca teg atg eta cca tec etc ggg caa cag ccg 1302 

Leu Pro Arg Gly Gly Pro Ser Met Leu Pro Ser Leu Gly Gin Gin Pro 

405 410 415 420 

ata ttg tea age age gtt tea aga aga age gat etc act ggt gcg ctg 13 50 

lie Leu Ser Ser Ser Val Ser Arg Arg Ser Asp Leu Thr Gly Ala Leu 
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425 430 435 

gcg gtt aga aac agt ate ccc gag acc aac age aga gtg tta cca act 13 98 
Ala Val Arg Asn Ser lie Pro Glu Thr Asn Ser Arg Val Leu Pro Thr 
440 445 450 

act cac teg gtc ttc aat aac ttc ccc gcg gat eta cct cgc age age 1446 
Thr His Ser Val Phe Asn Asn Phe Pro Ala Asp Leu Pro Arg Ser Ser 
455 460 465 

ttc ccg ttg gca agt gec cca ggg att tea gtt cca gta tea gtt tct 1494 
Phe Pro Leu Ala Ser Ala Pro Gly He Ser Val Pro Val Ser Val Ser 
470 475 480 

tac caa gaa gag gtc aac age teg gat gca aaa gga ggt tea tea get 1542 
Tyr Gin Glu Glu Val Asn Ser Ser Asp Ala Lys Gly Gly Ser Ser Ala 
485 490 495 500 

get act get gga ttt ggt aac cca age tac gac ata ttt aac gat ttt 1590 
Ala Thr Ala Gly Phe Gly Asn Pro Ser Tyr Asp He Phe Asn Asp Phe 

505 510 515 

ccg cag cac caa cag cac aac aag aac ate age aat aaa eta aac gat 1638 
Pro Gin His Gin Gin His Asn Lys Asn He Ser Asn Lys Leu Asn Asp 
520 525 530 

tgg gat ctg egg aat atg gga ttg gtc ttc agt tec aat cag gac gca 1686 
Trp Asp Leu Arg Asn Met Gly Leu Val Phe Ser Ser Asn Gin Asp Ala 
535 540 545 

gca act gca acc gca acc gca gca ttt tec act teg gaa gca tac tct 1734 
Ala Thr Ala Thr Ala Thr Ala Ala Phe Ser Thr Ser Glu Ala Tyr Ser 
550 555 560 

teg tct tct acg cag aga aaa aga egg gaa acg gac gca aca gtt gtg 1782 
Ser Ser Ser Thr Gin Arg Lys Arg Arg Glu Thr Asp Ala Thr Val Val 
565 570 575 580 

ggt gag cat ggg cag aac ctg cag tea ccg age egg aat ctg tat cat 1830 
Gly Glu His Gly Gin Asn Leu Gin Ser Pro Ser Arg Asn Leu Tyr His 

585 590 595 

ctg aac cac gtt ttt atg gac ggt ggt tea gtc aga gtg aag tea gaa 1878 
Leu Asn His Val Phe Met Asp Gly Gly Ser Val Arg Val Lys Ser Glu 
600 605 610 

aga gtg gcg gag aca gtg act tgt cct cca gca aat aca ttg ttt cac 1926 
Arg Val Ala Glu Thr Val Thr Cys Pro Pro Ala Asn Thr Leu Phe His 
615 620 625 

gag cag tat aat caa gaa gat ctg atg age gca ttt etc aaa cag gtt 1974 
Glu Gin Tyr Asn Gin Glu Asp Leu Met Ser Ala Phe Leu Lys Gin Val 
630 635 640 

tga ttattactcg aatacagtgc actctaaaac 2007 

<210> 2 
<211> 644 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Val Asn Pro Gly His Gly Arg Gly Pro Asp Ser Gly Thr Ala Ala 
15 10 15 

Gly Gly Ser Asn Ser Asp Pro Phe Pro Ala Asn Leu Arg Val Leu Val 
20 25 30 

Val Asp Asp Asp Pro Thr Cys Leu Met lie Leu Glu Arg Met Leu Met 
35 40 45 
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Thr Cys Leu Tyr Arg Glu Gin Arg Ala His Cys Leu Cys Phe Gly Arg 
50 55 60 

Thr Lys Asn Gly Phe Asp lie Val lie Ser Asp Val His Met Pro Asp 
65 70 75 80 

. Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu 

85 90 95 

Pro Val lie Asn Leu Asn Val Leu Lys Pro Leu Val He Val Met Ser 
100 105 110 

Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val Thr His Gly Ala 
. 115 120 125 

Val Asp Tyr Leu He Lys Pro Val Arg He Glu Ala Leu Lys Asn He 
130 135 140 

Trp Gin His Val Val Arg Lys Lys Arg Asn Glu Trp Asn Val Ser Glu 
145 150 155 160' 

His Ser Gly Gly Ser He Glu Asp Thr Gly Gly Asp Arg Asp Arg Gin 
165 170 175 

Gin Gin His Arg Glu Asp Ala Asp Asn Asn Ser Ser Ser Val Asn Glu 
180 185 190 

Gly Asn Gly Arg Ser Ser Arg Lys Arg Lys Glu Glu Glu Val Asp Asp 
195 200 205 

Gin Gly Asp Asp Lys Glu Asp Ser Ser Ser Leu Lys Lys Pro Arg Val 
210 215 220 

Val Trp Ser Val Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin 
225 230 235 240 

Leu Gly Val Asp Ser Glu Leu Lys Thr Cys Leu Leu Met His Leu Cys 
245 250 255 

Val Ser He Gly Asn lie Val Glu Phe Gin Lys Tyr Arg He Tyr Leu 
260 265 270 

Arg Arg Leu Gly Gly Val Ser Gin His Gin Gly Asn Met Asn His Ser 
275 280 285 

Phe Met Thr Gly Gin Asp Gin Ser Phe Gly Pro Leu Ser Ser Leu Asn 
290 295 300 

Gly Phe Asp Leu Gin Ser Leu Ala Val Thr Gly Gin Leu Pro Pro Gin 
305 310 315 320 

Ser Leu Ala Gin Leu Gin Ala Ala Gly Leu Gly Arg Pro Thr Leu Ala 
325 330 335 

Lys Pro Gly Met Ser Val Ser Pro Leu Val Asp Gin Arg Ser He Phe 
340 345 350 
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Asn Phe Glu Asn Pro Lys lie Arg Phe Gly Asp Gly His Gly Gin Thr 
355 360 365 

Met Asn Asn Gly Asn Leu Leu His Gly Val Pro Thr Gly Ser His Met 
370 375 380 

Arg Leu Arg Pro Gly Gin Asn Val Gin Ser Ser Gly Met Met Leu Pro 
385 390 395 400 

Val Ala Asp Gin Leu Pro Arg Gly Gly Pro Ser Met Leu Pro Ser Leu 
405 410 415 

Gly Gin Gin Pro lie Leu Ser Ser Ser Val Ser Arg Arg Ser Asp Leu 
420 425 430 

Thr Gly Ala Leu Ala Val Arg Asn Ser He Pro Glu Thr Asn Ser Arg 
435 440 445 

Val Leu Pro Thr Thr His Ser Val Phe Asn Asn Phe Pro Ala Asp Leu 
450 455 460 

Pro Arg Ser Ser Phe Pro Leu Ala Ser Ala Pro Gly He Ser Val Pro 
465 470 475 480 

Val Ser Val Ser Tyr Gin Glu Glu Val Asn Ser Ser Asp Ala Lys Gly 
485 490 495 

Gly Ser Ser Ala Ala Thr Ala Gly Phe Gly Asn Pro Ser Tyr Asp He 
500 505 510 

Phe Asn Asp Phe Pro Gin His Gin Gin His Asn Lys Asn He Ser Asn 
515 520 525 

Lys Leu Asn Asp Trp Asp Leu Arg Asn Met Gly Leu Val Phe Ser Ser 
530 535 540 

Asn Gin Asp Ala Ala Thr Ala Thr Ala Thr Ala Ala Phe Ser Thr Ser 
545 550 555 560 

Glu Ala Tyr Ser Ser Ser Ser Thr Gin Arg Lys Arg Arg Glu Thr Asp 
565 570 575 

Ala Thr Val Val Gly Glu His Gly Gin Asn Leu Gin Ser Pro Ser Arg 
580 585 590 

Asn Leu Tyr His Leu Asn His Val Phe Met Asp Gly Gly Ser Val Arg 
595 600 605 

Val Lys Ser Glu Arg Val Ala Glu Thr Val Thr Cys Pro Pro Ala Asn 
610 615 620 

Thr Leu Phe His Glu Gin Tyr Asn Gin Glu Asp Leu Met Ser Ala Phe 
625 630 635 640 



Leu Lys Gin Val 
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<210> 


3 


<211> 


834 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(73) . . (588) 


<223> 


G732 


<4O0> 


3 



aaaaaaacca aacataaaac ataaaactct gtcctttttt tgtcttcttg taacttttct 60 

tgttaaaaat ca atg gcg tea tct age age aca tac egg age tea age tct 111 
Met Ala Ser Ser Ser Ser Thr Tyr Arg Ser Ser Ser Ser 
15 10 

tec gac ggt ggt aat aat aac ccg teg gac tec gtc gtc acc gtc gac 159 
Ser Asp Gly Gly Asn Asn Asn Pro Ser Asp Ser Val Val Thr Val Asp 
15 20 25 

gaa cga aaa cgt aaa aga atg tta teg aac aga gaa tct gca cgt agg 207 
Glu Arg Lys Arg Lys Arg Met Leu Ser Asn Arg Glu Ser Ala Arg Arg 
30 35 40 45 

tea agg atg cgt aaa cag aaa cac gtt gat gat eta acg get cag ate 255 
Ser Arg Met Arg Lys Gin Lys His Val Asp Asp Leu Thr Ala Gin lie 
50 55 60 

aat cag eta tea aac gac aac cgt cag ate ttg aac age etc acc gta 3 03 

Asn Gin Leu Ser Asn Asp Asn Arg Gin lie Leu Asn Ser Leu Thr Val 
65 70 75 

aca tct cag ctt tac atg aag ate caa gee gag aac tct gtt etc acc 351 
Thr Ser Gin Leu Tyr Met Lys lie Gin Ala Glu Asn Ser Val Leu Thr 
80 85 90 

get cag atg gag gag ctt age acc aga etc caa tct etc aac gag ate 399 
Ala Gin Met Glu Glu Leu Ser Thr Arg Leu Gin Ser Leu Asn Glu lie 
95 100 105 

gtt gat ctt gtt caa tec aac ggt gca gga ttt ggt gtt gac cag ate 447 
Val Asp Leu Val Gin Ser Asn Gly Ala Gly Phe Gly Val Asp Gin He 
110 115 120 125 

gac ggc tgt ggt ttt gat gat cgt acg gtt ggg ate gac gga tat tac 495 
Asp Gly Cys Gly Phe Asp Asp Arg Thr Val Gly He Asp Gly Tyr Tyr 
130 135 140 

gat gat atg aat atg atg agt aat gtt aat cat tgg ggt ggt teg gtt 543 
Asp Asp Met Asn Met Met Ser Asn Val Asn His Trp Gly Gly Ser Val 
145 150 155 

tac act aac caa ccc att atg get aat gat ate aat atg tat tga 588 
Tyr Thr Asn Gin Pro He Met Ala Asn Asp He Asn Met Tyr 
160 165 170 



ttaataaaat 


taattaaaat 


aattagatgc 


cccttttttg 


tctttttatt 


ttaaaattta 


648 


geccattttg 


gtgtttttgg 


gttggtgtga 


tgatgtaatt 


atagtacatg 


catctttgat 


708 


tggttggaag 


gataaatata 


aactttatat 


atatattggg 


geatatatat 


atgagttgta 


768 


etttgeatgt 


attggtgtgt 


gttttgttat 


aattatatga 


ttatatatgt 


ttatgttaaa 


828 


aaaaaa 












834 



<210> 4 
<211> 171 
<212> PRT 
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<213> Arabidopsis thaliana 
<400> 4 

Met Ala Ser Ser Ser Ser Thr Tyr Arg Ser Ser Ser Ser Ser Asp Gly 
15 10 15 



Gly Asn Asn Asn Pro Ser Asp Ser Val Val Thr Val Asp Glu Arg Lys 
20 25 30 



Arg Lys Arg Met Leu Ser Asn Arg Glu Ser Ala Arg Arg Ser Arg Met 
35 40 45 



Arg Lys Gin Lys His Val Asp" Asp Leu Thr Ala Gin lie Asn Gin Leu 
50 55 ' 60 



Ser Asn Asp Asn Arg Gin lie Leu Asn Ser Leu Thr Val Thr Ser Gin 
65 70 75 80 

Leu Tyr Met Lys lie Gin Ala Glu Asn Ser Val Leu Thr Ala Gin Met 
85 90 95 



Glu Glu Leu Ser Thr Arg Leu Gin Ser Leu Asn Glu lie Val Asp Leu 
100 105 110 



Val Gin Ser Asn Gly Ala Gly Phe Gly Val Asp Gin lie Asp Gly Cys 
115 120 125 



Gly Phe Asp Asp Arg Thr Val Gly lie Asp Gly Tyr Tyr Asp Asp Met 
130 135 140 

Asn Met Met Ser Asn Val Asn His Trp Gly Gly Ser Val Tyr Thr Asn 
145 150 155 160 



Gin Pro lie Met Ala Asn Asp lie Asn Met Tyr 
165 170 



<210> 


5 


<211> 


1246 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(81) . . (1139) 


<223> 


G9 


<400> 


5 



gtgtttcttc tttctgctaa aaggttataa tttttgtttc ttggtttggt gagaatcttc 60 

aagaaactga aacaaagaaa atg gat tct agt tgc ata gac gag ata agt tec 113 

Met Asp Ser Ser Cys lie Asp Glu He Ser Ser 
15 10 

tec act tea gaa tct ttc tec gee ace ace gee aag aag etc tct cct 161 
Ser Thr Ser Glu Ser Phe Ser Ala Thr Thr Ala Lys Lys Leu Ser Pro 
15 20 25 

cct ccc gcg gcg gcg tta cgc etc tac egg atg gga age ggc ggg age 209 
Pro Pro Ala Ala Ala Leu Arg Leu Tyr Arg Met Gly Ser Gly Gly Ser 
30 35 40 

age gtc gtg ttg gat ccc gag aac ggc eta gag acg gag tea cga aag 257 
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Ser Val Val Leu Asp Pro Glu Asn Gly Leu Glu Thr Glu Ser Arg Lys 
45 50 55 

eta cca tct tea aaa tac aaa ggt gtt gtt cct cag cct aac gga aga 305 
Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gin Pro Asn Gly Arg 
60 65 70 75 

tgg gga get cag ate tac gag aag cac caa cga gta tgg etc ggg act 353 
Trp Gly Ala Gin lie Tyr Glu Lys His Gin Arg Val Trp Leu Gly Thr 
80 85 90 

ttc aac gag caa gaa gaa get get cgt tec tac gac ate gca get tgt 401 
Phe Asn Glu Gin Glu Glu Ala Ala Arg Ser Tyr Asp lie Ala Ala Cys 
95 100 105 

aga ttc cgt ggc cgc gac gee gtc gtc aac ttc aag aac gtt ctg gaa 449 
Arg Phe Arg Gly Arg Asp Ala Val Val Asn Phe Lys Asn Val Leu Glu 
110 115 120 

gac ggc gat tta get ttt ctt gaa get cac tea aag gee gag ate gtc 497 
Asp Gly Asp Leu Ala Phe Leu Glu Ala His Ser Lys Ala Glu lie Val 
125 130 135 

gac atg ttg aga aaa cac act tac gee gac gag ctt gaa cag aac aat 545 
Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu Leu Glu Gin Asn Asn 
140 145 150 155 

aaa egg cag ttg ttt etc tec gtc gac get aac gga aaa cgt aac gga 593 
Lys Arg Gin Leu Phe Leu Ser Val Asp Ala Asn Gly Lys Arg Asn Gly 
160 165 170 

teg agt act act caa aac gac aaa gtt tta aag acg tgt gaa gtt ctt 641 
Ser Ser Thr Thr Gin Asn Asp Lys Val Leu Lys Thr Cys Glu Val Leu 
175 180 185 

ttc gag aag get gtt aca cct age gac gtt ggg aag eta aac cgt etc 689 
Phe Glu Lys Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu 
190 195 200 

gtg ata cct aaa caa cac gee gag aaa cac ttt ccg tta ccg tea ccg 73 7 

Val lie Pro Lys Gin His Ala Glu Lys His Phe Pro Leu Pro Ser Pro 
205 210 215 

tea ccg gca gtg act aaa gga gtt ttg ate aac ttc gaa gac gtt aac 785 
Ser Pro Ala Val Thr Lys Gly Val Leu lie Asn Phe Glu Asp Val Asn 
220 225 230 235 

ggt aaa gtg tgg agg ttc cgt tac tea tac tgg aac agt agt caa agt 833 
Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser 
240 245 250 

tac gtg ttg acc aag gga tgg agt cga ttc gtc aag gag aag aat ctt 881 
Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu 
255 260 265 

cga gee ggt gat gtt gtt act ttc gag aga teg acc gga eta gag egg 929 
Arg Ala Gly Asp Val Val Thr Phe Glu Arg Ser Thr Gly Leu Glu Arg 
270 275 280 

cag tta tat att gat tgg aaa gtt egg tct ggt ccg aga gaa aac ccg 977 
Gin Leu Tyr lie Asp Trp Lys Val Arg Ser Gly Pro Arg Glu Asn Pro 
285 290 295 

gtt cag gtg gtg gtt egg ctt ttc gga gtt gat ate ttt aat gtg acc 1025 
Val Gin Val Val Val Arg Leu Phe Gly Val Asp lie Phe Asn Val Thr 
300 305 310 315 

acc gtg aag cca aac gac gtc gtg gee gtt tgc ggt gga aag aga tct 1073 
Thr Val Lys Pro Asn Asp Val Val Ala Val Cys Gly Gly Lys Arg Ser 
320 325 330 

cga gat gtt gat gat atg ttt gcg tta egg tgt tec aag aag cag gcg 1121 
Arg Asp Val Asp Asp Met Phe Ala Leu Arg Cys Ser Lys Lys Gin Ala 
335 340 345 
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ata ate aat get ttg tga catatttcct tttccgattt tatgettteg 1169 
He He Asn Ala Leu 
350 

ttttttaatt tttttttttg tcaagttgtg taggttgtga ttcatgetag gttgtattta 1229 
ggaaaagaga taagacc 124 6 

<210> 6 

<211> 352 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 6 

Met Asp Ser Ser Cys He Asp Glu lie Ser Ser Ser Thr Ser Glu Ser 

1 : 5 ■ - 10 15 

Phe Ser Ala Thr Thr Ala Lys Lys Leu Ser Pro Pro Pro Ala Ala Ala 
20 25 30 

Leu Arg Leu Tyr Arg Met Gly Ser Gly Gly Ser Ser Val Val Leu Asp 
35 40 45 

Pro Glu Asn Gly Leu Glu Thr Glu Ser Arg Lya Leu Pro Ser Ser Lys 
50 55 60 

Tyr Lys Gly Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin He 
65 70 75 80 

Tyr Glu Lys His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Gin Glu 
85 90 95 

Glu Ala Ala Arg Ser Tyr Asp He Ala Ala Cys Arg Phe Arg Gly Arg 
100 105 110 

Asp Ala Val Val Asn Phe Lys Asn Val Leu Glu Asp Gly Asp Leu Ala 
115 120 125 

Phe Leu Glu Ala His Ser Lys Ala Glu He Val Asp Met Leu Arg Lys 
130 135 140 

His Thr Tyr Ala Asp Glu Leu Glu Gin Asn Asn Lys Arg Gin Leu Phe 
145 150 155 160 

Leu Ser Val Asp Ala Asn Gly Lys Arg Asn Gly Ser Ser Thr Thr Gin 
165 170 175 

Asn Asp Lys Val Leu Lys Thr Cys Glu Val Leu Phe Glu Lys Ala Val 
180 185 190 

Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val lie Pro Lys Gin 
195 200 205 

His Ala Glu Lys His Phe Pro Leu Pro Ser Pro Ser Pro Ala Val Thr 
210 215 220 

Lys Gly Val Leu He Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg 
225 230 235 240 
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Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys 
245 250 255 

Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val 
260 265 270 

Val Thr Phe Glu Arg Ser Thr Gly Leu Glu Arg Gin Leu Tyr He Asp 
275 280 285 

Trp Lys Val Arg Ser Gly Pro Arg Glu Asn Pro Val Gin Val Val Val 
290 295 300 

Arg Leu Phe Gly Val Asp He Phe Asn Val Thr Thr Val Lys Pro Asn 
305 310 315 320 

Asp Val Val Ala Val Cys Gly Gly Lys Arg Ser Arg Asp Val Asp Asp 
325 330 335 

Met Phe Ala Leu Arg Cys Ser Lys Lys Gin Ala He lie Asn Ala Leu 

345 350 





340 


<210> 


7 


<211> 


1379 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(97) . . (1032) 


<223> 


G428 


<400> 


7 



ttacttttgt gtttcttcat attcttcaga agcaagcaca aggctaggga tcgaagaagc 60 

ggcgatcact gatcgtatct cactacgatc acatta atg gat aga atg tgt ggt 114 

Met Asp Arg Met Cys Gly 

1 5 

ttc cgc teg acg gaa gac tat teg gag aaa gcg acg ttg atg atg ccg 162 

Phe Arg Ser Thr Glu Asp Tyr Ser Glu Lys Ala Thr Leu Met Met Pro 

10 15 20 

tec gat tat cag tct ttg att tgt tea ace acc gga gac aat caa aga 210 
Ser Asp Tyr Gin Ser Leu He Cys Ser Thr Thr Gly Asp Asn Gin Arg 
25 30 35 

ctg ttt gga tec gac gaa etc get acc get ttg tec teg gag ttg ctt 258 
Leu Phe Gly Ser Asp Glu Leu Ala Thr Ala Leu Ser Ser Glu Leu Leu 
40 45 50 

ccg cgt att cga aaa get gag gat aat ttc tct ctt agt gtc ate aaa 306 
Pro Arg He Arg Lys Ala Glu Asp Asn Phe Ser Leu Ser Val He Lys 
55 60 65 70 

tec aaa ate get tct cat cct ttg tat cct cgc tta etc caa acc tac 354 
Ser Lys He Ala Ser His Pro Leu Tyr Pro Arg Leu Leu Gin Thr Tyr 
75 80 85 

ate gat tgc caa aag gtg gga gcg cct atg gaa ata gcg tgt ata ttg 402 
He Asp Cys Gin Lys Val Gly Ala Pro Met Glu He Ala Cys He Leu 
90 95 100 

gaa gag att cag cga gag aac cat gtg tac aag aga gat gtt get cca 450 
Glu Glu He Gin Arg Glu Asn His Val Tyr Lys Arg Asp Val Ala Pro 
105 110 115 
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tea tct tgc ctt gga get gat cct gag ctt gat gaa ttc atg gaa acc 
Leu Ser Cys Phe Gly Ala Asp Pro Glu Leu Asp Glu Phe Met Glu Thr 
120 125 130 



498 



tac tgt gat ata ttg gtt aaa tac aaa acc gat ctt gcg agg ccg ttc 
Tyr Cys Asp lie Leu Val Lys Tyr Lys Thr Asp Leu Ala Arg Pro Phe 
135 140 145 150 



546 



gac gag get aca act ttc ata aac aag att gaa atg cag ctt cag aac 
Asp Glu Ala Thr Thr Phe He Asn Lys He Glu Met Gin Leu Gin Asn 
155 160 165 



594 



ttg tgc act ggt cca gcg tct get aca get ctt tea gat gat ggt gcg 
Leu Cys Thr Gly Pro Ala Ser Ala Thr Ala Leu Ser Asp Asp Gly Ala 
170 175 • 180 



642 



gtt tea tct gac gag gaa ctg aga gaa gat gat gac ata gca gcg gat 690 

val Ser Ser Asp Glu Glu Leu Arg Glu Asp Asp Asp lie Ala Ala Asp 

185 190 195 

gac age caa caa aga age aat gac cgc gat ctg aag gac cag eta eta 73 8 

Asp Ser Gin Gin Arg Ser Asn Asp Arg Asp Leu Lys Asp Gin Leu Leu 

200 205 210 

cgc aaa ttt ggt age cat ate agt tea ttg aaa etc gag ttc tct aaa 786 

Arg Lys Phe Gly Ser His He Ser Ser Leu Lys Leu Glu Phe Ser Lys 

215 220 225 230 

aag aag aag aaa ggg aag eta cca aga gaa gca aga caa gcg ttg etc 834 

Lys Lys Lys Lys Gly Lys Leu Pro Arg Glu Ala Arg Gin Ala Leu Leu 
235 240 245 

gat tgg tgg aat gtt cat aat aaa tgg cct tac cct act gaa ggc gac 882 

Asp Trp Trp Asn Val His Asn Lys Trp Pro Tyr Pro Thr Glu Gly Asp 
250 255 260 

aaa ata get ctg get gaa gaa aca ggt ttg gat caa aaa caa ate aac 930 

Lys He Ala Leu Ala Glu Glu Thr Gly Leu Asp Gin Lys Gin He Asn 

265 270 275 

aat tgg ttt ata aac caa agg aaa cgc cat tgg aag cct teg gag aac 978 

Asn Trp Phe He Asn Gin Arg Lys Arg His Trp Lys Pro Ser Glu Asn 

280 285 290 

atg ccg ttt gat atg atg gac gat tct aat gaa aca ttc ttt acc gag 1026 

Met Pro Phe Asp Met Met Asp Asp Ser Asn Glu Thr Phe Phe Thr Glu 

295 300 305 310 

gaa tga aaagagagac atgggattgt gcattgtata atttttacac tgttttccca 1082 
Glu 



agaaaagaaa 


acagtaaaaa 


gcttttggta 


aatgggacat 


catcgegaat 


gaatggaacc 


1142 


agttagccaa 


aacggtcaag 


ggcgtggcgt 


aacgagacat 


tgtattggaa 


atagtggcaa 


1202 


tattatgtca 


ctaatcttcc 


aatggtccaa 


aatgatagat 


ttcttatttg 


tattgaacct 


1262 


tacttagata 


gctgatgtgt 


caactaaata 


atttattttc 


atccttatac 


tacttgtatc 


1322 


aatgtctcta 


attgatcaat 


tgttgcttgc 


tattcaaaaa 


aaaaaaaaaa 


aaaaaaa 


1379 



<210> 8 
<211> 311 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 8 

Met Asp Arg Met Cys Gly Phe Arg Ser Thr Glu Asp Tyr Ser Glu Lys 
15 10 15 



Ala Thr Leu Met Met Pro Ser . Asp Tyr Gin Ser Leu He Cys Ser Thr 
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20 25 30 



Thr Gly Asp Asn Gin Arg Leu Phe Gly Ser Asp Glu Leu Ala Thr Ala 
35 40 45 



Leu Ser Ser Glu Leu Leu Pro Arg lie Arg Lys Ala Glu Asp Asn Phe 
50 55 60 



Ser Leu Ser Val lie Lys Ser Lys He Ala Ser His Pro Leu Tyr Pro 
65 70 75 80 



Arg Leu Leu Gin Thr Tyr He Asp Cys Gin Lys Val Gly Ala Pro Met 

85-90 95 



Glu He Ala Cys He Leu Glu Glu He Gin Arg Glu Asn His Val Tyr 
100 105 110 



Lys Arg Asp Val Ala Pro Leu Ser Cys Phe Gly Ala Asp Pro Glu Leu 
115 120 125 



Asp Glu Phe Met Glu Thr Tyr Cys Asp He Leu Val Lys Tyr Lys Thr 
130 135 140 



Asp Leu Ala Arg Pro Phe Asp Glu Ala Thr Thr Phe He Asn Lys He 
145 150 155 160 



Glu Met Gin Leu Gin Asn Leu Cys Thr Gly Pro Ala Ser Ala Thr Ala 

165 170 175 



Leu Ser Asp Asp Gly Ala Val Ser Ser Asp Glu Glu Leu Arg Glu Asp 
180 185 190 



Asp Asp He Ala Ala Asp Asp Ser Gin Gin Arg Ser Asn Asp Arg Asp 
195 200 205 



Leu Lys Asp Gin Leu Leu Arg Lys Phe Gly Ser His He Ser Ser Leu 
210 215 220 



Lys Leu Glu Phe Ser Lys Lys Lys Lys Lys Gly Lys Leu Pro Arg Glu 
225 230 235 240 



Ala Arg Gin Ala Leu Leu Asp Trp Trp Asn Val His Asn Lys Trp Pro 

245 250 255 



Tyr Pro Thr Glu Gly Asp Lys He Ala Leu Ala Glu Glu Thr Gly Leu 
260 265 270 



Asp Gin Lys Gin He Asn Asn Trp Phe He Asn Gin Arg Lys Arg His 
275 280 285 



Trp Lys Pro Ser Glu Asn Met Pro Phe Asp Met Met Asp Asp Ser Asn 
290 295 300 



Glu Thr Phe Phe Thr Glu Glu 
305 310 
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<210> 


9 


<211> 


1571 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(428) . . (1402) 


<223> 


G869 


<400> 


9 



aggaacagtg aaaggttcgg ttttttgggt ttcgatctga taatcaacaa gaaaaaaggg 60 

tttgatttat gtcggctggg tttgaatcga ctgtgatttt gtctttgatt catatctctt 120 

ctccgatttc atcatcatct tccccatcat cgtcgtcttt gaaatcttgt cttctcaacg 180 

ctcttcactt ctgctgtaat aagcagaggc ttgttctgga gactccttct ctttccatgc 240 

gcttaagacc caaaaggact tgttctagtg ttgaagtctt tgggggtttt cacataaagc 300 

agcaaaagtt ttcttttttc atagttcgct gagagttttg agttttgata ccaaaaaagt 360 

tttgaccttt tagagtgatt ttttgttctt tctgttttct gggtattttt gaggagtggg 420 

tttaaca atg gtt gcg att aga aag gaa cag tct ttg agt ggt gtt agt 469 
Met Val Ala lie Arg Lys Glu Gin Ser Leu Ser Gly Val Ser 
1 5 10 

age gag att aag aag aga get aag aga aac act eta teg tec ctt cct 517 
Ser Glu lie Lys Lys Arg Ala Lys Arg Asn Thr Leu Ser Ser Leu Pro 
15 20 25 30 

caa gaa acc caa cct ttg agg aaa gtc cgt att att gtg aat gat cct 565 
Gin Glu Thr Gin Pro Leu Arg Lys Val Arg lie lie Val Asn Asp Pro 

35 40 45 

tat get act gat gat tec tct agt gat gag gaa gag ctt aag gtt cct 613 
Tyr Ala Thr Asp Asp Ser Ser Ser Asp Glu Glu Glu Leu Lys Val Pro 
50 55 60 



aag cca agg aaa atg aaa cgt ate gtt cgt gag att aac ttt cct tct 
Lys Pro Arg Lys Met Lys Arg lie Val Arg Glu lie Asn Phe Pro Ser 
65 70 75 



661 



atg gaa gtt tct gaa cag cct tct gag agt tct tct cag gac agt act 709 

Met Glu Val Ser Glu Gin Pro Ser Glu Ser Ser Ser Gin Asp Ser Thr 
80 85 90 

aaa act gat ggc aag ata get gtg tea get tct cct get gtt cct agg 757 

Lys Thr Asp Gly Lys lie Ala Val Ser Ala Ser Pro Ala Val Pro Arg 

95 100 105 110 

aag aag cct gtt ggt gtt agg caa agg aaa tgg ggg aaa tgg get get 805 

Lys Lys Pro Val Gly Val Arg Gin Arg Lys Trp Gly Lys Trp Ala Ala 

115 120 125 

gag att aga gat cct att aag aaa act agg act tgg ttg ggt act ttt 853 

Glu He Arg Asp Pro He Lys Lys Thr Arg Thr Trp Leu Gly Thr Phe 

130 135 140 

gat act ctt gaa gaa get get aaa get tat gat get aag aag ctt gag 901 

Asp Thr Leu Glu Glu Ala Ala Lys Ala Tyr Asp Ala Lys Lys Leu Glu 

145 150 155 

ttt gat get att gtt get gga aat gtg tec act act aaa cgt gat gtt 949 

Phe Asp Ala He Val Ala Gly Asn Val Ser Thr Thr Lys Arg Asp Val 
160 165 170 

tct tea tct gag act age caa tgc tct cgt tct tea cct gtt gtt cct 997 

Ser Ser Ser Glu Thr Ser Gin Cys Ser Arg Ser Ser Pro Val Val Pro 

175 180 185 190 

gtt gag caa gat gac act tct gca tea get etc act tgt gtc aac aac 1045 
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Val Glu Gin Asp Asp Thr Ser Ala Ser Ala Leu Thr Cys Val Asn Asn 
195 200 205 

cct gat gac gtc teg acc gtt get cca act get cca act cca aat gtt 1093 
Pro Asp Asp Val Ser Thr Val Ala Pro Thr Ala Pro Thr Pro Asn Val _ 
210 215 220 

cct get ggt gga aac aag gaa acg ttg ttc gat ttc gac ttt act aat 1141 
Pro Ala Gly Gly Asn Lys Glu Thr Leu Phe Asp Phe Asp Phe Thr Asn 
225 230 235 

eta cag ate cct gat ttt ggt ttc ttg gca gag gag caa caa gac eta 1189 
Leu Gin lie Pro Asp Phe Gly Phe Leu Ala Glu Glu Gin Gin Asp Leu 
240 245 250 

gac ttc gat tgt ttc etc gcg gat gat cag ttt gat gat ttc ggc ttg 1237 
Asp Phe Asp Cys Phe Leu Ala Asp Asp Gin Phe Asp Asp Phe Gly Leu 
255 260 265 270 

ctt gat gac att caa gga ttc gaa gat aac ggt cca agt gcg tta cca 1285 
Leu Asp Asp lie Gin Gly Phe Glu Asp Asn Gly Pro Ser Ala Leu Pro 
275 280 285 

gat ttc gac ttt gcg gat gtt gaa gat ctt cag eta get gac tct agt 1333 
Asp Phe Asp Phe Ala Asp Val Glu Asp Leu Gin Leu Ala Asp Ser Ser 
290 295 300 

ttc ggt ttc ctt gat caa ctt get cct ate aac ate tct tgc cca tta 1381 
Phe Gly Phe Leu Asp Gin Leu Ala Pro lie Asn lie Ser Cys Pro Leu 
305 310 315 

aaa agt ttt gca get tea tag gatcttgett agtaatgtta agtgagaaga 1432 
Lys Ser Phe Ala Ala Ser 
320 

gtgttttgtt ttttcgttta tgctttagta atttaagaca tacaaaagtg tgtgttccgg 1492 

attgtagtaa gatcttaaga cataaagecg ggttttgcaa ttaggaatcg agttttaatg 1552 

aagttttagt ttatgtttg 1571 

<210> 10 
<211> 324 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 10 

Met Val Ala He Arg Lys Glu Gin Ser Leu Ser Gly Val Ser Ser Glu 
15 10 15 

He Lys Lys Arg Ala Lys Arg Asn Thr Leu Ser Ser Leu Pro Gin Glu 
20 25 30 

Thr Gin Pro Leu Arg Lys Val Arg He He Val Asn Asp Pro Tyr Ala 
35 40 45 

Thr Asp Asp Ser Ser Ser Asp Glu Glu Glu Leu Lys Val Pro Lys Pro 
50 55 60 

Arg Lys Met Lys Arg He Val Arg Glu He Asn Phe Pro Ser Met Glu 
65 70 75 80 

Val Ser Glu Gin Pro Ser Glu Ser Ser Ser Gin Asp Ser Thr Lys Thr 
85 90 95 

Asp Gly Lys He Ala Val Ser Ala Ser Pro Ala Val Pro Arg Lys Lys 
100 105 110 
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Pro Val Gly Val Arg Gin Arg Lys Trp Gly Lys Trp Ala Ala Glu He 
115 120 125 



Arg Asp Pro He Lys Lys Thr Arg Thr Trp Leu Gly Thr Phe Asp Thr 
130 135 140 



Leu Glu Glu Ala Ala Lys Ala Tyr Asp Ala Lys Lys Leu Glu Phe Asp 
145 150 155 160 



Ala He Val Ala Gly Asri Val Ser Thr Thr Lys Arg Asp Val Ser Ser 
165 170 175 



Ser Glu Thr Ser Gin Cys Ser Arg Ser Ser Pro Val Val Pro Val Glu 
180 185 190 



Gin Asp Asp Thr Ser Ala Ser Ala Leu Thr Cys Val Asn Asn Pro Asp 
195 200 205 

Asp Val Ser Thr Val Ala Pro Thr Ala Pro Thr Pro Asn Val Pro Ala 
210 215 220 

Gly Gly Asn Lys Glu Thr Leu Phe Asp Phe Asp Phe Thr Asn Leu Gin 
225 230 235 240 

lie Pro Asp Phe Gly Phe Leu Ala Glu Glu Gin Gin Asp Leu Asp Phe 
245 250 255 



Asp Cys Phe Leu Ala Asp Asp Gin Phe Asp Asp Phe Gly Leu Leu Asp 
260 265 270 

Asp He Gin Gly Phe Glu Asp Asn Gly Pro Ser Ala Leu Pro Asp Phe 
275 280 285 



Asp Phe Ala Asp Val Glu Asp Leu Gin Leu Ala Asp Ser Ser Phe Gly 
290 295 300 



Phe Leu Asp Gin Leu Ala Pro He Asn He Ser Cys Pro Leu Lys Ser 
305 310 315 320 



Phe Ala Ala Ser 



<210> 


11 


<211> 


1166 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(88) - - (951) 


<223> 


G1269 


<400> 


11 



aacaattctc cctctcctta ttcttcttct tcagcttcag atttcagatc ttaaatcttc 60 

aagtcttctt cttcttcttc tgcaacc atg get atg cag gaa cgt tgt gag agt 114 

Met Ala Met Gin Glu Arg Cys Glu Ser 
1 5 
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tta tgt tct gat gaa ctt ata tct tec tea gat gec ttt tac etc aag 162 
Leu Cys Ser Asp Glu Leu lie Ser Ser Ser Asp Ala Phe Tyr Leu Lys 
10 15 20 25 

aca aga aag cct tat acc ate act aaa caa aga gag aaa tgg aca gaa . v ' 210 
Thr Arg Lys Pro Tyr Thr lie Thr Lys Gin Arg Glu Lys Trp Thr Glu 
30 35 40 

gca gag cat gag aag ttt gta gaa gca ttg aaa etc tat ggc aga get 258 
Ala Glu His Glu Lys Phe Val Glu Ala Leu Lys Leu Tyr Gly Arg Ala 
45 50 55 

tgg aga cga ate gaa gaa cat gtt gga aca aaa act gca gtt cag att 306 
Trp Arg Arg He Glu Glu His Val Gly Thr Lys Thr Ala Val Gin He 
60 . 65 70 

cga age cat gcg cag aag ttc ttt act aag gtt get cgc gat ttt ggt 354 
Arg Ser His Ala Gin Lys Phe Phe Thr Lys Val Ala Arg Asp Phe Gly 
75 80 85 

gtt age tct gag tec att gag ate ccg cct cca agg cca aag aga aag 402 
Val Ser Ser Glu Ser He Glu He Pro Pro Pro Arg Pro Lys Arg Lys 
90 95 100 105 

ccg atg cat cct tac cct aga aag ctt gtg att cct gat gca aaa gag 450 
Pro Met His Pro Tyr Pro Arg Lys Leu Val He Pro Asp Ala Lys Glu 
110 115 120 

atg gta tac get gaa eta acc gga tec aag ctg att cag gat gaa gat 498 
Met Val Tyr Ala Glu Leu Thr Gly Ser Lys Leu lie Gin Asp Glu Asp 
125 130 135 

aac cga tct cca aca teg gtt tta tea get cat ggc tea gat gga tta 546 
Asn Arg Ser Pro Thr Ser Val Leu Ser Ala His Gly Ser Asp Gly Leu 
140 145 150 

ggt tec att ggt tea aat tea cct aac tct tct tea get gag tta tea 594 
Gly Ser He Gly Ser Asn Ser Pro Asn Ser Ser Ser Ala Glu Leu Ser 
155 160 165 

tct cac aca gag gaa tea ttg tct eta gaa gca gag acc aaa cag age 642 
Ser His Thr Glu Glu Ser Leu Ser Leu Glu Ala Glu Thr Lys Gin Ser 
170 175 180 185 

ctt aag etc ttt gga aaa act ttt gta gtt ggt gat tac aac tct tea 690 
Leu Lys Leu Phe Gly Lys Thr Phe Val Val Gly Asp Tyr Asn Ser Ser 
190 195 200 

atg agt tgt gat gat tct gaa gat ggc aag aag aag eta tac tea gaa 738 
Met Ser Cys Asp Asp Ser Glu Asp Gly Lys Lys Lys Leu Tyr Ser Glu 
205 210 215 

aca cag tct ctt caa tgt tct tct tct act tea gaa aac get gaa aca 786 
Thr Gin Ser Leu Gin Cys Ser Ser Ser Thr Ser Glu Asn Ala Glu Thr 
220 225 230 

gaa gtg gta gtg teg gag ttc aaa aga agt gag aga tea get ttc tct 834 
Glu Val Val Val Ser Glu Phe Lys Arg Ser Glu Arg Ser Ala Phe Ser 
235 240 245 

cag tta aaa teg teg gtg act gag atg aac aac atg aga ggg ttc atg 882 
Gin Leu Lys Ser Ser Val Thr Glu Met Asn Asn Met Arg Gly Phe Met 
250 255 260 265 

cct tac aaa aag aga gta aag gtg gaa gaa aac att gac aat gta aaa 930 
Pro Tyr Lys Lys Arg Val Lys Val Glu Glu Asn He Asp Asn Val Lys 
270 275 280 

tta tea tat cct ttg tgg tga agtgttcgtt tgtgtcaagt cagttgtgta 981 
Leu Ser Tyr Pro Leu Trp 
285 

aactcttttg atctcaacat cagattatgt gtataatgtc agagtattag ggaaagtttt 1041 



Page 16 



WO 01/36444 



PCTAJS00/31325 



MBI0018 Sequence Listing. ST25 
tttggattag attcgtaaga tcactccaaa gtttcgtgcc tttccatata accagttaga 1101 

aattgagatc cttgtactta aacattttta tttgatcaat caaatcttct tgatgaaaaa 1161 

aaaaa 1166 

<210> 12 

<211> 287 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 12 

Met Ala Met Gin Glu Arg Cys Glu Ser Leu Cys Ser Asp Glu Leu lie 
1 5 10 15 

Ser Ser Ser Asp Ala Phe Tyr Leu Lys Thr Arg Lys Pro Tyr Thr He 
20 25 30 

Thr Lys Gin Arg Glu Lys Trp Thr Glu Ala Glu His Glu Lys Phe Val 
35 40 45 

Glu Ala Leu Lys Leu Tyr Gly Arg Ala Trp Arg Arg He Glu Glu His 
50 55 60 

Val Gly Thr Lys Thr Ala Val Gin He Arg Ser His Ala Gin Lys Phe 
65 70 75 80 

Phe Thr Lys Val Ala Arg Asp Phe Gly Val Ser Ser Glu Ser He Glu 
85 90 95 

He Pro Pro Pro Arg Pro Lys Arg Lys Pro Met His Pro Tyr Pro Arg 
100 105 110 

Lys Leu Val He Pro Asp Ala Lys Glu Met Val Tyr Ala Glu Leu Thr 
115 120 125 

Gly Ser Lys Leu He Gin Asp Glu Asp Asn Arg Ser Pro Thr Ser Val 
130 135 140 

Leu Ser Ala His Gly Ser Asp Gly Leu Gly Ser He Gly Ser Asn Ser 
145 150 155 160 

Pro Asn Ser Ser Ser Ala Glu Leu Ser Ser His Thr Glu Glu Ser Leu 
165 170 175 

Ser Leu Glu Ala Glu Thr Lys Gin Ser Leu Lys Leu Phe Gly Lys Thr 
180 185 190 

Phe Val Val Gly Asp Tyr Asn Ser Ser Met Ser Cys Asp Asp Ser Glu 
195 200 205 

Asp Gly Lys Lys Lys Leu Tyr Ser Glu Thr Gin Ser Leu Gin Cys Ser 
210 215 220 

Ser Ser Thr Ser Glu Asn Ala Glu Thr Glu Val Val Val Ser Glu Phe 
225 230 235 240 



Lys Arg Ser Glu Arg Ser Ala Phe Ser Gin Leu Lys Ser Ser Val Thr 
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245 250 255 

Glu Met Asn Asn Met Arg Gly Phe Met Pro Tyr Lys Lys Arg Val Lys 
260 265 270 

Val Glu Glu Asn lie Asp Asn Val Lys Leu Ser Tyr Pro Leu Trp 

280 285 





275 


<210> 


13 


<211> 


2031 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS ■ 


<222> 


(240) . . (1574) 


<223> 


G1038 


<400> 


13 



gctcgttttc aaattaaaaa cagggagaaa tttggaaatt ccagtacgac gggagataaa 60 

acctaacata cgccatggtg accgttatct aaactacgcc aaaatatttg aagtgtcgtc 120 

gtttcataat aaaacgcaaa caaaaaccca ctcccacttt ctcctttcca aaaaaagaac 180 

tctcgccact ttctctgctc ttttctttct ctctctcttt cttgttttcg ccggcgatc 239 

atg gag aaa age ggc ttc tct ccc gtc ggt eta agg gtt ctt gtc gta 287 
Met Glu Lys Ser Gly Phe Ser Pro Val Gly Leu Arg Val Leu Val Val 
15 10 15 

gac gat gat cca act tgg etc aag att etc gag aaa atg etc aag aag 335 
Asp Asp Asp Pro Thr Trp Leu Lys lie Leu Glu Lys Met Leu Lys Lys 
20 25 30 

tgt tct tac gaa gta acg ace tgt gga tta get aga gag get ttg agg 383 
Cys Ser Tyr Glu Val Thr Thr Cys Gly Leu Ala Arg Glu Ala Leu Arg 
35 40 45 

ttg ctg agg gag cgt aaa gat gga tat gat ate gtg ate age gat gtg 431 
Leu Leu Arg Glu Arg Lys Asp Gly Tyr Asp lie Val lie Ser Asp Val 
50 55 60 

aac atg cct gac atg gat ggt ttc aag ctt ctt gag cat gtt ggt ctt 479 
Asn Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu 
65 70 75 80 

gaa tta gac etc cct gta ata atg atg teg gtg gac ggc gaa aca age 527 
Glu Leu Asp Leu Pro Val He Met Met Ser Val Asp Gly Glu Thr Ser 
85 90 95 

cga gtg atg aag gga gtg cac acg gga get tgt gat tac etc ttg aag 575 
Arg Val Met Lys Gly Val His Thr Gly Ala Cys Asp Tyr Leu Leu Lys 
100 105 110 

ccg ata aga atg aag gag tta aag att ata tgg caa cat gtt ctg aga 623 
Pro He Arg Met Lys Glu Leu Lys lie He Trp Gin His Val Leu Arg 
115 120 125 

aag aag ctt caa gaa gtg aga gat ate gaa ggc tgt gga tac gaa gga 671 
Lys Lys Leu Gin Glu Val Arg Asp He Glu Gly Cys Gly Tyr Glu Gly 
130 135 140 

gga gcg gat tgg ate act cga tac gat gaa gca cat ttt ctt gga ggt 719 
Gly Ala Asp Trp He Thr Arg Tyr Asp Glu Ala His Phe Leu Gly Gly 
145 150 155 160 

ggt gaa gat gtt tct ttt ggg aaa aag aga aaa gac ttt gac ttt gag 767 
Gly Glu Asp Val Ser Phe Gly Lys Lys Arg Lys Asp Phe Asp Phe Glu 
165 170 175 
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aag aag ctt ctt caa gat gag age gat cca tea tct tct tct tec aag 
Lys Lys Leu Leu Gin Asp Glu Ser Asp Pro Ser Ser Ser Ser Ser Lys 
180 185 190 



815 



aaa get aga gtt gtt tgg tct ttt gag ctt cat cat aag ttt gtc aac 
Lys Ala Arg Val Val Trp Ser Phe Glu Leu His His Lys Phe Val Asn 
195 200 205 



863 



gec gtt aac caa ate gga tgc gat cac aaa get ggt ccc aag aag ata 
Ala Val Asn Gin lie Gly Cys Asp His Lys Ala Gly Pro Lys Lys lie 
210 215 220 



911 



ttg gat etc atg aat gtt cca tgg etc act aga gaa aat gtt gca age 
Leu Asp Leu Met Asn Val Pro Trp Leu Thr Arg Glu Asn Val Ala Ser 
225 230 235 240 



959 



cac ctt cag aaa tat aga ctt tac ctg age aga tta gag aaa gga aag 
His Leu Gin Lys Tyr' Arg Leu Tyr Leu Ser Arg Leu Glu Lys Gly Lys 
245 250 255 



1007 



gag etc aag tgt tat tea ggt ggc gtg aag aat gcg gat tea tct cca 
Glu Leu Lys Cys Tyr Ser Gly Gly Val Lys Asn Ala Asp Ser Ser Pro 
260 265 270 



1055 



aaa gat gtc gaa gtg aat tea ggc tac caa age cct ggg agg age age 
Lys Asp Val Glu Val Asn Ser Gly. Tyr Gin Ser Pro Gly Arg Ser Ser 
275 280 285 



1103 



tat gta ttc tct gga gga aat tct ctg ate caa aaa gca aca gag att 
Tyr Val Phe Ser Gly Gly Asn Ser Leu lie Gin Lys Ala Thr Glu lie 
290 295 300 



1151 



gat cca aag cca ctt get tea get tct ttg tct gac ccc aac ace gat 
Asp Pro Lys Pro Leu Ala Ser Ala Ser Leu Ser Asp Pro Asn Thr Asp 
305 310 315 320 



1199 



gtg ate atg cct ccg aaa aca aaa aag acg cgt ata gga ttt gat cct 
Val lie Met Pro Pro Lys Thr Lys Lys Thr Arg lie Gly Phe Asp Pro 
325 330 335 



1247 



ccc att tec tec tct gcg ttt gac tct ctg ctt cct tgg aat gat gtt 
Pro lie Ser Ser Ser Ala Phe Asp Ser Leu Leu Pro Trp Asn Asp Val 
340 345 350 



1295 



cca gag gtc ctt gaa teg aag ccg gtt ctg tat gag aat age ttt etc 
Pro Glu Val Leu Glu Ser Lys Pro Val Leu Tyr Glu Asn Ser Phe Leu 
355 360 365 



1343 



cag caa caa cca ttg cca agt caa agt tec tat gtt gca att tct gca 
Gin Gin Gin Pro Leu Pro Ser Gin Ser Ser Tyr Val Ala He Ser Ala 
370 375 380 



1391 



cca tct etc atg gag gag gaa atg aag cct cct tat gag aca cca gca 
Pro Ser Leu Met Glu Glu Glu Met Lys Pro Pro Tyr Glu Thr Pro Ala 
385 390 395 400 



1439 



99^ ggc agt agt gtg aat gca gat gag ttt etc atg cca caa gac aag 
Gly Gly Ser Ser Val Asn Ala Asp Glu Phe Leu Met Pro Gin Asp Lys 
405 410 415 



1487 



ate cct act gta ace ctt caa gat ttg gat ccc tct gec atg aag ctg 
He Pro Thr Val Thr Leu Gin Asp Leu Asp Pro Ser Ala Met Lys Leu 
420 425 430 



1535 



cag gag ttc aac aca gaa ggc gat tct gaa gaa get tga actggggaac 
Gin Glu Phe Asn Thr Glu Gly Asp Ser Glu Glu Ala 
435 440 



1584 



ttccagaatc 


acatcattct 


gtttctttag 


acactgactt 


agacttgact 


tggcttcaag 


1644 


gcgagcgttt 


ettgeaaaca 


ccgactccag 


tttcaagata 


cagtagtagc 


ccatcactcc 


1704 


tatctgagct 


cccagcccac 


cttaattggt 


atggaaatga 


gcggctgcct 


gaccctgacg 


1764 


agtattcctt 


catggtagac 


caaggtttat 


tcatatctta 


accttgttcc 


aataacttct 


1824 
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tttcgtatat tggttggtgt aatgcagaaa gattttgtgg gtatacctga aaataatctt 1884 

gctttcccaa gaaccttcca tgatcggatg cattgtacaa taatccacga gtgtcgtagg 1944 

ctaattacac caaacaggtt gatgacagtg ataaggccac atgtttcaca ccgtcgctta" 2004 

agatctttac tgtcacctgg aaggaaa 2031 

<210> 14 
<211> 444 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 14 

Met Glu Lys .Ser Gly Phe Ser Pro Val Gly Leu Arg Val Leu Val Val 
15 10 15 

Asp Asp Asp Pro Thr Trp Leu Lys lie Leu Glu Lys Met Leu Lys Lys 
20 25 30 

Cys Ser Tyr Glu Val Thr Thr Cys Gly Leu Ala Arg Glu Ala Leu Arg 
35 40 45 

Leu Leu Arg Glu Arg Lys Asp Gly Tyr Asp lie Val lie Ser Asp Val 
50 55 60 

Asn Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu 
65 70 75 80 

Glu Leu Asp Leu Pro Val He Met Met Ser Val Asp Gly Glu Thr Ser 
85 90 95 

Arg Val Met Lys Gly Val His Thr Gly Ala Cys Asp Tyr Leu Leu Lys 
100 105 110 

Pro He Arg Met Lys Glu Leu Lys lie He Trp Gin His Val Leu Arg 
115 120 125 

Lys Lys Leu Gin Glu Val Arg Asp He Glu Gly Cys Gly Tyr Glu Gly 
13 0 13 5 140 

Gly Ala Asp Trp He Thr Arg Tyr Asp Glu Ala His Phe Leu Gly Gly 
145 150 155 160 

Gly Glu Asp Val Ser Phe Gly Lys Lys Arg Lys Asp Phe Asp Phe Glu 
165 170 175 

Lys Lys Leu Leu Gin Asp Glu Ser Asp Pro Ser Ser Ser Ser Ser Lys 
180 185 190 

Lys Ala Arg Val Val Trp Ser Phe Glu Leu His His Lys Phe Val Asn 
195 200 205 

Ala Val Asn Gin He Gly Cys Asp His Lys Ala Gly Pro Lys Lys He 
210 215 220 

Leu Asp Leu Met Asn Val Pro Trp Leu Thr Arg Glu Asn Val Ala Ser 
225 230 235 240 
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His Leu Gin Lys Tyr Arg Leu Tyr Leu Ser Arg Leu Glu Lys Gly Lys 
245 250 255 

Glu Leu Lys Cys Tyr Ser Gly Gly Val Lys Asn Ala Asp Ser Ser Pro 
260 265 270 

Lys Asp Val Glu Val Asn Ser Gly Tyr Gin Ser Pro Gly Arg Ser Ser 
275 280 285 

Tyr Val Phe Ser Gly Gly Asn Ser Leu lie Gin Lys Ala Thr Glu lie 
290 295_ 300 

Asp Pro- Lys Pro Leu Ala Ser Ala Ser Leu Ser Asp Pro Asn Thr Asp 
305 310 315 320 

Val lie Met Pro Pro Lys Thr Lys Lys Thr Arg lie Gly Phe Asp Pro 
325 330 335 

Pro lie Ser Ser Ser Ala Phe Asp Ser Leu Leu Pro Trp Asn Asp Val 
340 345 350 

Pro Glu Val Leu Glu Ser Lys Pro Val Leu Tyr Glu Asn Ser Phe Leu 
355 360 365 

Gin Gin Gin Pro Leu Pro Ser Gin Ser Ser Tyr Val Ala lie Ser Ala 
370 375 380 

Pro Ser Leu Met Glu Glu Glu Met Lys Pro Pro Tyr Glu Thr Pro Ala 
385 390 395 400 

Gly Gly Ser Ser Val Asn Ala Asp Glu Phe Leu Met Pro Gin Asp Lys 
405 410 415 

lie Pro Thr Val Thr Leu Gin Asp Leu Asp Pro Ser Ala Met Lys Leu 
420 425 430 

Gin Glu Phe Asn Thr Glu Gly Asp Ser Glu Glu Ala 

440 





435 


<210> 


15 


<211> 


2821 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(188) . . (2716) 


<223> 


G438 


<400> 


15 



cggggtaccc aagccacgac cgtagaatct tcttttgtct gaaaagaatt acaatttacg 60 

tttctcttac gatacgacgg actttccgaa gaaattaatt taaagagaaa agaagaagaa 120 

gccaaagaag aagaagaagc tagaagaaac agtaaagttt gagacttttt ttgagggtcg 180 

agctaaa atg gag atg gcg gtg get aac cac cgt gag aga age agt gac 229 
Met Glu Met Ala Val Ala Asn His Arg Glu Arg Ser Ser Asp 
1 5 .10 
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agt atg aat aga cat tta gat agt age ggt aag tac gtt agg tac aca 2 77 

Ser Met Asn Arg His Leu Asp Ser Ser Gly Lys Tyr Val Arg Tyr Thr 
15 20 25 30 

get gag caa gtc gag get ctt gag cgt gtc tac get gag tgt cct aag - 325 
Ala Glu Gin Val Glu Ala Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys 
35 40 45 

cct age tct etc cgt cga caa caa ttg ate cgt gaa tgt tec att ttg 3 73 

Pro Ser Ser Leu Arg Arg Gin Gin Leu lie Arg Glu Cys Ser lie Leu 
50 55 60 

gee aat att gag cct aag cag ate aaa gtc tgg ttt cag aac cgc agg 421 
Ala Asn lie Glu Pro Lys Gin lie Lys Val Trp Phe Gin Asn Arg Arg 
65 70 75 

tgt cga gat aag cag agg aaa gag gcg teg agg etc cag age gta aac 469 
Cys Arg Asp Lys Gin Arg Lys Glu Ala Ser Arg Leu Gin Ser Val Asn 
80 85 90 

egg aag etc tct gcg atg aat aaa ctg ttg atg gag gag aat gat agg 517 
Arg Lys Leu Ser Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg 
95 100 105 110 

ttg cag aag cag gtt tct cag ctt gtc tgc gaa aat gga tat atg aaa 565 
Leu Gin Lys Gin Val Ser Gin Leu Val Cys Glu Asn Gly Tyr Met Lys 
115 120 125 

cag cag eta act act gtt gtt aac gat cca age tgt gaa tct gtg gtc 613 
Gin Gin Leu Thr Thr Val Val Asn Asp Pro Ser Cys Glu Ser Val Val 
130 135 140 

aca act cct cag cat teg ctt aga gat gcg aat agt cct get gga ttg 661 
Thr Thr Pro Gin His Ser Leu Arg Asp Ala Asn Ser Pro Ala Gly Leu 
145 150 155 

etc tea ate gca gag gag act ttg gca gag ttc eta tec aag get aca 709 
Leu Ser lie Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr 
160 165 170 

gga act get gtt gat tgg gtt cag atg cct ggg atg aag cct ggt ccg 757 
Gly Thr Ala Val Asp Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro 
175 180 185 190 

gat teg gtt ggc ate ttt gee att teg caa aga tgc aat gga gtg gca 805 
Asp Ser Val Gly lie Phe Ala lie Ser Gin Arg Cys Asn Gly Val Ala 
195 200 205 

get cga gee tgt ggt ctt gtt age tta gaa cct atg aag att gca gag 853 
Ala Arg Ala Cys Gly Leu Val Ser Leu Glu Pro Met Lys lie Ala Glu 
210 215 220 

ate etc aaa gat egg cca tct tgg ttc cgt gac tgt agg age ctt gaa 901 
lie Leu LyB Asp Arg Pro Ser Trp Phe Arg Asp Cys Arg Ser Leu Glu 
225 230 235 

gtt ttc act atg ttc ccg get ggt aat ggt ggc aca ate gag ctt gtt 949 
Val Phe Thr Met Phe Pro Ala Gly Asn Gly Gly Thr He Glu Leu Val 
240 245 250 

tat atg cag acg tat gca cca acg act ctg get cct gee cgc gat ttc 997 
Tyr Met Gin Thr Tyr Ala Pro Thr Thr Leu Ala Pro Ala Arg Asp Phe 
255 260 265 270 

tgg ace ctg aga tac aca acg age etc gac aat ggg agt ttt gtg gtt 1045 
Trp Thr Leu Arg Tyr Thr Thr Ser Leu Asp Asn Gly Ser Phe Val Val 
275 280 285 

tgt gag agg teg eta tct ggc tct gga get ggg cct aat get get tea 1093 
Cys Glu Arg Ser Leu Ser Gly Ser Gly Ala Gly Pro Asn Ala Ala Ser 
290 295 300 

get tct cag ttt gtg aga gca gaa atg ctt tct agt ggg tat tta ata 1141 
Ala Ser Gin Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu He 
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305 310 315 

agg cct tgt gat ggt ggt ggt tct att att cac att gtc gat cac ctt 1189 
Arg Pro Cys Asp Gly Gly Gly Ser lie He His He Val Asp His Leu 
320 325 330 

aat ctt gag get tgg agt gtt ccg gat gtg ctt cga ccc ctt tat gag 1237 
Asn Leu Glu Ala Trp Ser Val Pro Asp Val Leu Arg Pro Leu Tyr Glu 
335 340 345 350 

tea tec aaa gtc gtt gca caa aaa atg acc att tec gcg ttg egg tat 1285 
Ser Ser Lys Val Val Ala Gin Lys Met Thr lie Ser Ala Leu Arg Tyr 
355 360 365 

ate agg caa tta gec caa gag tct aat ggt gaa gta gtg tat gga tta 1333 
He Arg Gin Leu Ala Gin Glu Ser Asn Gly Glu Val Val Tyr Gly Leu 
370 * 375 380 

gga agg cag cct get gtt ctt aga acc ttt age caa aga tta age agg 1381 
Gly Arg Gin Pro Ala Val Leu Arg Thr Phe Ser Gin Arg Leu Ser Arg 
385 390 395 

ggc ttc aat gat gcg gtt aat ggg ttt ggt gac gac ggg tgg tct acg 1429 
Gly Phe Asn Asp Ala Val Asn Gly Phe Gly Asp Asp Gly Trp Ser Thr 
400 405 410 

atg cat tgt gat gga gcg gaa gat att ate gtt get att aac tct aca 1477 
Met His Cys Asp Gly Ala Glu Asp He He Val Ala lie Asn Ser Thr 
415 420 425 430 

aag cat ttg aat aat att tct aat tct ctt teg ttc ctt gga ggc gtg 1525 
Lys His Leu Asn Asn He Ser Asn Ser Leu Ser Phe Leu Gly Gly Val 
435 440 445 

etc tgt gee aag get tea atg ctt etc caa aat gtt cct cct gcg gtt 1573 
Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val 
450 455 460 

ttg ate egg ttc ctt aga gag cat cga tct gag tgg get gat ttc aat 1621 
Leu He Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Phe Asn 
465 470 475 

gtt gat gca tat tec get get aca ctt aaa get ggt age ttt get tat 1669 
Val Asp Ala Tyr Ser Ala Ala Thr Leu Lys Ala Gly Ser Phe Ala Tyr 
480 485 490 

ccg gga atg aga cca aca aga ttc act ggg agt cag ate ata atg cca 1717 
Pro Gly Met Arg Pro Thr Arg Phe Thr Gly Ser Gin lie He Met Pro 
495 500 505 510 

eta gga cat aca att gaa cac gaa gaa atg eta gaa gtt gtt aga ctg 1765 
Leu Gly His Thr He Glu His Glu Glu Met Leu Glu Val Val Arg Leu 
515 520 525 

gaa ggt cat tct ctt get caa gaa gat gca ttt atg tea egg gat gtc 1813 
Glu Gly His Ser Leu Ala Gin Glu Asp Ala Phe Met Ser Arg Asp Val 
530 535 540 



cat etc ctt cag att tgt acc ggg att gac gag aat gee gtt gga get 
His Leu Leu Gin He Cys Thr Gly He Asp Glu Asn Ala Val Gly Ala 
545 550 555 



acg gga gat gta caa gat ctg tta acc get aat cac cgt aca eta gac 
Thr Gly Asp Val Gin Asp Leu Leu Thr Ala Asn His Arg Thr Leu Asp 
595 600 605 



1861 



tgt tct gaa ctg ata ttt get ccg att aat gag atg ttc ccg gat gat 1909 
Cys Ser Glu Leu He Phe Ala Pro lie Asn Glu Met Phe Pro Asp Asp 
560 565 570 

get cca ctt gtt ccc tct gga ttc cga gtc ata ccc gtt gat get aaa 1957 
Ala Pro Leu Val Pro Ser Gly Phe Arg Val He Pro Val Asp Ala LyB 
575 580 585 590 



2005 



tta act tct age ctt gaa gtc ggt cca tea cct gag aat get tct gga 2053 
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Leu Thr Ser Ser Leu Glu Val Gly Pro Ser Pro Glu Asn Ala Ser Gly 
610 615 620 

aac tct ttt tct age tea age teg aga tgt att etc act ate gcg ttt 2101 
Asn Ser Phe Ser Ser Ser Ser Ser Arg Cys lie Leu Thr lie Ala Phe 
625 630 635 

caa ttc cct ttt gaa aac aac ttg caa gaa aat gtt get ggt atg get 2149 
Gin Phe Pro Phe Glu Asn Asn Leu Gin Glu Asn Val Ala Gly Met Ala 
640 645 650 

tgt cag tat gtg agg age gtg ate tea tea gtt caa cgt gtt gca atg 2197 
Cys Gin Tyr Val Arg Ser Val lie Ser Ser Val Gin Arg Val Ala Met 
655 660 665 670 

gcg ate tea ccg tct ggg ata age ccg agt ctg ggc tec aaa ttg tec 2245 
Ala lie Ser Pro Ser Gly lie" Ser Pro Ser Leu Gly Ser Lys Leu Ser 
' 675 680 685 

cca gga tct cct gaa get gtt act ctt get cag tgg ate tct caa agt 2293 
Pro Gly Ser Pro Glu Ala Val Thr Leu Ala Gin Trp lie Ser Gin Ser 
690 695 700 

tac agt cat cac tta ggc teg gag ttg ctg acg att gat tea ctt gga 2341 
Tyr Ser His His Leu Gly Ser Glu Leu Leu Thr lie Asp Ser Leu Gly 
705 710 715 

age gac gac teg gta eta aaa ctt eta tgg gat cac caa gat gee ate 23 89 

Ser Asp Asp Ser Val Leu Lys Leu Leu Trp Asp His Gin Asp Ala lie 
720 725 730 

ctg tgt tgc tea tta aag cca cag cca gtg ttc atg ttt gcg aac caa 243 7 

Leu Cys Cys Ser Leu Lys Pro Gin Pro Val Phe Met Phe Ala Asn Gin 
735 740 745 750 

get ggt eta gac atg eta gag aca aca ctt gta gee tta caa gat ata 2485 
Ala Gly Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp lie 

755 760 765 

aca etc gaa aag ata ttc gat gaa teg ggt cgt aag get ate tgt teg 2533 
Thr Leu Glu Lys lie Phe Asp Glu Ser Gly Arg Lys Ala He Cys Ser 
770 775 780 

gac ttc gee aag eta atg caa cag gga ttt get tgc ttg cct tea gga 2581 
Asp Phe Ala Lys Leu Met Gin Gin Gly Phe Ala Cys Leu Pro Ser Gly 
785 790 795 

ate tgt gtg tea acg atg gga aga cat gtg agt tat gaa caa get gtt 2629 
He Cys Val Ser Thr Met Gly Arg His Val Ser Tyr Glu Gin Ala Val 
800 805 810 

get tgg aaa gtg ttt get gca tct gaa gaa aac aac aac aat ctg cat 2677 
Ala Trp Lys Val Phe Ala Ala Ser Glu Glu Asn Asn Asn Asn Leu His 
815 820 825 830 

tgt ctt gee ttc tec ttt gta aac tgg tct ttt gtg tga ttcgattgac 2726 
Cys Leu Ala Phe Ser Phe Val Asn Trp Ser Phe Val 
835 840 

agaaaaagac taatttaaat ttacgttaga gaactcaaat ttttggttgt tgtttaggtg 2786 

tctctgtttt gttttttaaa attattttga tcaaa 2821 

<210> 16 
<211> 842 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 16 

Met Glu Met Ala Val Ala Asn His Arg Glu Arg Ser Ser Asp Ser Met 
15 10 15 
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Asn Arg His Leu Asp Ser Ser Gly Lys Tyr Val Arg Tyr Thr Ala Glu 
20 25 30 

Gin Val Glu Ala Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys Pro Ser 
35 40 45 

Ser Leu Arg Arg Gin Gin Leu lie Arg Glu Cys Ser He Leu Ala Asn 
50 55 ' 60 

lie Glu Pro Lys Gin He Lys Val Trp Phe Gin Asn Arg Arg Cys Arg 
65 70 75 80 

Asp Lys Gin Arg Lys Glu Ala Ser Arg Leu Gin Ser Val Asn Arg Lys 

: 85." 90 95 

Leu Ser Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin 
100 105 110 

Lys Gin Val Ser Gin Leu Val Cys Glu Asn Gly Tyr Met Lys Gin Gin 
115 120 125 

Leu Thr Thr Val Val Asn Asp Pro Ser Cys Glu Ser Val Val Thr Thr 
130 135 140 

Pro Gin His Ser Leu Arg Asp Ala Asn Ser Pro Ala Gly Leu Leu Ser 
145 ISO 155 160 

He Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr 
165 170 175 

Ala Val Asp Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro Asp Ser 
180. 185 190 

Val Gly He Phe Ala He Ser Gin Arg Cys Asn Gly Val Ala Ala Arg 
195 200 205 

Ala Cys Gly Leu Val Ser Leu Glu Pro Met LyB He Ala Glu He Leu 
210 215 220 

Lys Asp Arg Pro Ser Trp Phe Arg Asp Cys Arg Ser Leu Glu Val Phe 
225 230 235 240 

Thr Met Phe Pro Ala Gly Asn Gly Gly Thr He Glu Leu Val Tyr Met 
245 250 255 

Gin Thr Tyr Ala Pro Thr Thr Leu Ala Pro Ala Arg Asp Phe Trp Thr 
260 265 270 

Leu Arg Tyr Thr Thr Ser Leu Asp Asn Gly Ser Phe Val Val Cys Glu 
275 280 285 

Arg Ser Leu Ser Gly Ser Gly Ala Gly Pro Asn Ala Ala Ser Ala Ser 
290 295 300 

Gin Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu He Arg Pro 
305 310 315 320 
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Cys Asp Gly Gly Gly Ser He He His He Val Asp His Leu Asn Leu 
325 330 335 

Glu Ala Trp Ser Val Pro Asp Val Leu Arg Pro Leu Tyr Glu Ser Ser - 
340 345 350 

Lys Val Val Ala Gin Lys Met Thr He Ser Ala Leu Arg Tyr He Arg 
355 360 365 

Gin Leu Ala Gin Glu Ser Asn Gly Glu Val Val Tyr Gly Leu Gly Arg 
370 375 380 

Gin Pro Ala Val Leu Arg Thr Phe Ser Gin Arg Leu Ser Arg Gly Phe 
385 390 395 400 

Asn Asp Ala Val Asn Gly Phe Gly Asp Asp Gly Trp Ser Thr Met His 
405 410 415 

Cys Asp Gly Ala Glu Asp He He Val Ala He Asn Ser Thr Lys His 
420 425 430 

Leu Asn Asn He Ser Asn Ser Leu Ser Phe Leu Gly Gly Val Leu Cys 
435 440 445 

Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val Leu He 
450 455 460 

Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Phe Asn Val Asp 
465 470 475 480 

Ala Tyr Ser Ala Ala Thr Leu Lys Ala Gly Ser Phe Ala Tyr Pro Gly 
485 490 495 

Met Arg Pro Thr Arg Phe Thr Gly Ser Gin He He Met Pro Leu Gly 
500 505 510 

His Thr He Glu His Glu Glu Met Leu Glu Val Val Arg Leu Glu Gly 
515 • 520 525 

His Ser Leu Ala Gin Glu Asp Ala Phe Met Ser Arg Asp Val His Leu 
530 535 540 

Leu Gin lie Cys Thr Gly He Asp Glu Asn Ala Val Gly Ala Cys Ser 
545 550 555 560 

Glu Leu He Phe Ala Pro He Asn Glu Met Phe Pro Asp Asp Ala Pro 
565 570 575 

Leu Val Pro Ser Gly Phe Arg Val He Pro Val Asp Ala Lys Thr Gly 
580 585 590 

Asp Val Gin Asp Leu Leu Thr Ala Asn His Arg Thr Leu Asp Leu Thr 
595 600 605 

Ser Ser Leu Glu Val Gly Pro Ser Pro Glu Asn Ala Ser Gly Asn Ser 
610 615 620 
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Phe Ser Ser Ser Ser Ser Arg Cys lie Leu Thr lie Ala Phe Gin Phe 
625 630 635 640 



Pro Phe Glu Asn Asn Leu Gin Glu Asn Val Ala Gly Met Ala Cys Gin 
645 650 655 

Tyr Val Arg Ser Val He Ser Ser Val Gin Arg Val Ala Met Ala He 
660 665 670 

Ser Pro Ser Gly He Ser Pro Ser Leu Gly Ser Lys Leu Ser Pro Gly 
675 680 685 



Ser Pro Glu Ala Val Thr Leu Ala Gin Trp He Ser Gin Ser Tyr Ser 
690 695 700 



His His Leu Gly Ser Glu Leu Leu Thr He Asp Ser Leu Gly Ser Asp 
705 710 715 720 



Asp Ser Val Leu Lys Leu Leu Trp Asp His Gin Asp Ala He Leu Cys 
725 730 735 



Cys Ser Leu Lys Pro Gin Pro Val Phe Met Phe Ala Asn Gin Ala Gly 
740 745 750 

Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp He Thr Leu 
755 760 765 

Glu Lys He Phe Asp Glu Ser Gly Arg Lys Ala He Cys Ser Asp Phe 
770 775 780 

Ala Lys Leu Met Gin Gin Gly Phe Ala Cys Leu Pro Ser Gly He Cys 
785 790 795 800 

Val Ser Thr Met Gly Arg His Val Ser Tyr Glu Gin Ala Val Ala Trp 
805 810 815 

Lys Val Phe Ala Ala Ser Glu Glu Asn Asn Asn Asn Leu His Cys Leu 
820 825 830 



Ala Phe Ser Phe Val Asn Trp Ser Phe Val 

840 





835 


<210> 


17 


<211> 


1888 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(326) . . (1708) 


<223> 


G571 


<400> 


17 



tagccgacct ctcttctctc ttctgaaaaa aacaccaaag gagctttaaa tgctccgtta 60 

cataatctct atctctttcc aagaatatag agaaaggaaa ataatataca agaattaaaa 120 

gaaggtatat catcatctct ctagctagtg atcaaagcac cgtcatcatc atcatatatc 180 
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atcagcttgc ctcagaggag aagaccaaca caagagagat cgaagatcaa aatctatctc 240 

tcttcatcat cttctgctgc tactatcata tcacacgctc tctcaaacat catcctatat 300 

atagacttct cttcatcatc atcaa atg caa ggt cat cac cag aat cat cat *" 352 

Met Gin Gly His His Gin Asn His His 

1 5 

caa cac tta tea tea tec tec gec acg tct tec cat gga aac ttc atg 400 

Gin His Leu Ser Ser Ser Ser Ala Thr Ser Ser His Gly Asn Phe Met 
10 15 20 25 

aac aaa gat ggg tat gat att gga gag ata gac cca tea etc ttc etc 448 
Asn Lys Asp Gly Tyr Asp lie Gly Glu lie Asp Pro Ser Leu Phe Leu 

30 35 40 

tat ctt gat gga caa gga cat cat gat cct cca tea act get cct tct 4 96 

Tyr Leu Asp Gly Gin Gly His His Asp Pro Pro Ser Thr Ala Pro Ser 
45 50 55 

cct tta cat cat cat cac aca act cag aat ttg gcg atg aga cct cca 544 
Pro Leu His His His His Thr Thr Gin Asn Leu Ala Met Arg Pro Pro 
60 65 70 

aca teg acg etc aac ate ttt cca tct cag cct atg cac ata gag cca 592 
Thr Ser Thr Leu Asn lie Phe Pro Ser Gin Pro Met His lie Glu Pro 
75 80 85 

cct cct tct tct aca cac aat acc gat aat aca aga tta gtt ccg get 640 
Pro Pro Ser Ser Thr His Asn Thr Asp Asn Thr Arg Leu Val Pro Ala 
90 95 100 105 

get caa cct agt ggt tec act cga cca get tct gac ccg tec atg gac 688 
Ala Gin Pro Ser Gly Ser Thr Arg Pro Ala Ser Asp Pro Ser Met Asp 

110 115 120 

ttg acc aat cat tct cag ttt cat caa cct cct caa ggt tct aaa tec 736 
Leu Thr Asn His Ser Gin Phe His Gin Pro Pro Gin Gly Ser Lys Ser 
125 130 135 

ate aag aag gaa ggg aac cgc aag ggt ctt gee tea teg gac cat gac 784 
lie Lys Lys Glu Gly Asn Arg Lys Gly Leu Ala Ser Ser Asp His Asp 
140 145 150 

ata cct aaa teg tea gac cct aaa aca ttg aga aga eta gca caa aac 832 
He Pro Lys Ser Ser Asp Pro Lys Thr Leu Arg Arg Leu Ala Gin Asn 
155 160 165 

aga gaa gca gca aga aaa age aga tta cgt aaa aag get tat gtt cag 880 
Arg Glu Ala Ala Arg Lys Ser Arg Leu Arg Lys Lys Ala Tyr Val Gin 
170 175 180 185 

caa etc gag tea tgt agg ate aaa ctg acc caa eta gaa caa gag att 928 
Gin Leu Glu Ser Cys Arg He Lys Leu Thr Gin Leu Glu Gin Glu He 

190 195 200 

caa egg gee aga tec caa ggc gta ttc ttt gga ggg tct ctt ata gga 976 
Gin Arg Ala Arg Ser Gin Gly Val Phe Phe Gly Gly Ser Leu He Gly 
205 210 215 

gga gat caa cag caa ggt gga eta ccc att ggc cct ggc aac ate age 1024 
Gly Asp Gin Gin Gin Gly Gly Leu Pro He Gly Pro Gly Asn He Ser 
220 225 230 

tct gaa gca gcg gtg ttc gat atg gaa tat gcg agg tgg ctg gag gag 1072 
Ser Glu Ala Ala Val Phe Asp Met Glu Tyr Ala Arg Trp Leu Glu Glu 
235 240 245 

cag cag agg eta tta aac gaa eta agg gtg gca aca caa gaa cac ttg 1120 
Gin Gin Arg Leu Leu Asn Glu Leu Arg Val Ala Thr Gin Glu His Leu 
250 255 260 265 

tec gag aac gag ctt agg atg ttt gtg gac aca tgt tta get cat tat 1168 
Ser Glu Asn Glu Leu Arg Met Phe Val Asp Thr Cys Leu Ala His Tyr 
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275 280 



gac cat ttg att aac etc aag get atg gtc get aag ace gat gtc ttc 
Asp His Leu lie Asn Leu Lys Ala Met Val Ala Lys Thr Asp Val Phe 
285 290 295 

cac etc att tct gga gca tgg aaa act cca get gaa cgt tgc ttc ttg 

His Leu lie Ser Gly Ala Trp Lys Thr Pro Ala Glu Arg Cys Phe Leu 
300 305 310 

tgg atg ggt ggt ttc cgt cca teg gag ate att aag gtg att gtg aac 

Trp Met Gly Gly Phe Arg Pro Ser Glu lie He Lys Val He Val Asn 
315 320 325 

cag ata gaa cca ttg acg gag caa cag ata gtt ggg ata tgt ggg ctg 

Gin He Glu Pro Leu Thr Glu Gin Gin He Val Gly He Cys Gly Leu 
330 335 . " 340 345 

caa cag tec aca caa gag gec gag gag get etc teg caa ggc etc gag 
Gin Gin Ser Thr Gin Glu Ala Glu Glu Ala Leu Ser Gin Gly Leu Glu 

350 355 360 

gcg ttg aat caa tea ctt tec gat age att gtc tct gac tec etc ccg 

Ala Leu Asn Gin Ser Leu Ser Asp Ser lie Val Ser Asp Ser Leu Pro 
365 370 375 

cct gec tec gca cca ctt cct cct cat eta tec aat ttc atg tea cac 

Pro Ala Ser Ala Pro Leu Pro Pro His Leu Ser Asn Phe Met Ser His 
380 385 390 

atg tec tta get etc aac aag etc tct get etc gag ggc ttc gtt etc 

Met Ser Leu Ala Leu Asn Lys Leu Ser Ala Leu Glu Gly Phe Val Leu 
395 400 405 

cag gcg gat aat ttg agg cac caa acg ate cat agg ctg aac caa ttg 

Gin Ala Asp Asn Leu Arg His Gin Thr lie His Arg Leu Asn Gin Leu 
410 415 420 425 

ttg acg ace cgt caa gaa gca egg tgt ctt eta gee gtt gcg gag tac 

Leu Thr Thr Arg Gin Glu Ala Arg Cys Leu Leu Ala Val Ala Glu Tyr 

430 435 440 

ttc cac cgt ctt caa get eta age tct etc tgg eta gec cgt cct egg 

Phe His Arg Leu Gin Ala Leu Ser Ser Leu Trp Leu Ala Arg Pro Arg 
445 450 455 

caa gat gga taa tactaaaaca actgatgaag gaaaccaaaa acaaaaacaa 
Gin Asp Gly 
460 

gagaataggt tgattagtta gccgccagct tgacctcttt atcatatata tcgtctctct 
actcaaatac agtgcaatta gggaaaattg tttggcttct ttttggtata tgattcttac 
tattatgttt ttaatcaaga 



1216 

1264 

1312 

1360 

1408 

1456 

1504 

1552 

1600 

1648 

1696 

1748 

1808 
1868 
1888 



<210> 18 
<211> 460 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 18 

Met Gin Gly His His Gin Asn His His Gin His Leu Ser Ser Ser Ser 
15 10 15 

Ala Thr Ser Ser His Gly Asn Phe Met Asn Lys Asp Gly Tyr Asp lie 
20 25 30 



Gly Glu He Asp Pro Ser Leu Phe Leu Tyr Leu Asp Gly Gin Gly His 
35 40 45 
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His Asp Pro Pro Ser Thr Ala Pro Ser Pro Leu His His His His Thr 
50 55 60 



Thr Gin Asn Leu Ala Met Arg Pro Pro Thr Ser Thr Leu Asn lie Phe 
65 70 75 80 



Pro Ser Gin Pro Met His lie Glu Pro Pro Pro Ser Ser Thr His Asn 
85 90 95 



Thr Asp Asn Thr Arg Leu Val Pro Ala Ala Gin Pro Ser Gly Ser Thr 
100 105 110 



Arg Pro Ala Ser Asp Pro Ser Met Asp Leu Thr Asn His Ser Gin Phe 
115 120 125 



His Gin Pro Pro Gin Gly Ser Lys Ser lie Lys Lys Glu Gly Asn Arg 
130 135 140 



Lys Gly Leu Ala Ser Ser Asp His Asp He Pro Lys Ser Ser Asp Pro 
145 150 155 160 



Lys Thr Leu Arg Arg Leu Ala Gin Asn Arg Glu Ala Ala Arg Lys Ser 
165 170 175 



Arg Leu Arg Lys Lys Ala Tyr Val Gin Gin Leu Glu Ser Cys Arg He 
180 185 190 



Lys Leu Thr Gin Leu Glu Gin Glu He Gin Arg Ala Arg Ser Gin Gly 
195 200 205 



Val Phe Phe Gly Gly Ser Leu He Gly Gly Asp Gin Gin Gin Gly Gly 
210 215 220 



Leu Pro He Gly Pro Gly Asn He Ser Ser Glu Ala Ala Val Phe Asp 
225 230 235 240 



Met Glu Tyr Ala Arg Trp Leu Glu Glu Gin Gin Arg Leu Leu Asn Glu 
245 250 255 



Leu Arg Val Ala Thr Gin Glu His Leu Ser Glu Asn Glu Leu Arg Met 
260 265 270 



Phe Val Asp Thr Cys Leu Ala His Tyr Asp His Leu He Asn Leu Lys 
275 280 285 



Ala Met Val Ala Lys Thr Asp Val Phe His Leu He Ser Gly Ala Trp 
290 295 300 



Lys Thr Pro Ala Glu Arg Cys Phe Leu Trp Met Gly Gly Phe Arg Pro 
305 310 315 320 

Ser Glu He He Lys Val He Val Asn Gin He Glu Pro Leu Thr Glu 
325 330 335 

Gin Gin He Val- Gly He Cys Gly Leu Gin Gin Ser Thr Gin Glu Ala 
340 345 350 
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Glu Glu Ala Leu Ser Gin Gly Leu Glu Ala Leu Asn Gin Ser Leu Ser 
355 360 365 

Asp Ser lie Val Ser Asp Ser Leu Pro Pro Ala Ser Ala Pro Leu Pro 
370 375 380 

Pro His Leu Ser Asn Phe Met Ser His Met Ser Leu Ala Leu Asn Lys 
385 390 395 400 

Leu Ser Ala Leu Glu Gly Phe Val Leu Gin Ala Asp Asn Leu Arg His 
40S 410 415 

Gin Thr lie His Arg Leu Asn Gin Leu Leu Thr Thr Arg Gin Glu Ala 
420 425 430 

Arg Cys Leu Leu Ala Val Ala Glu Tyr Phe His Arg Leu Gin Ala Leu 
435 440 445 

Ser Ser Leu Trp Leu Ala Arg Pro Arg Gin Asp Gly 
450 455 460 



<210> 


19 


<211> 


1707 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(98) . . (1444) 


<223> 


G748 


<400> 


19 



ccacgcgtcc gcactctccc aaatctctct tctttaacaa caaaaaaaaa atcacagaga 60 

catagagaga agaagacgga acagaggctc caaaaaa atg atg atg gag act aga 115 

Met Met Met Glu Thr Arg 
1 5 

gat cca get att aag ctt ttc ggt atg aaa ate cct ttt ccg teg gtt 163 
Asp Pro Ala lie Lys Leu Phe Gly Met Lys lie Pro Phe Pro Ser Val 
10 15 20 

ttt gaa teg gca gtt acg gtg gag gat gac gaa gaa gat gac tgg age 211 
Phe Glu Ser Ala Val Thr Val Glu Asp Asp Glu Glu Asp Asp Trp Ser 
25 30 35 

ggc gga gat gac aaa tea cca gag aag gta act cca gag tta tea gat 259 
Gly Gly Asp Asp Lys Ser Pro Glu Lys Val Thr Pro Glu Leu Ser Asp 
40 45 50 



aag aac aac aac aac tgt aac gac aac agt ttt aac aat teg aaa ccc 
Lys Asn Asn Asn Asn Cys Asn Asp Asn Ser Phe Asn Asn Ser Lys Pro 
55 60 65 70 



eta aag aaa ccg act aag att eta ccg tgt ccg aga tgc aaa age atg 
Leu Lys Lys Pro Thr Lys He Leu Pro Cys Pro Arg Cys Lys Ser Met 
105 110 115 



307 



gaa acc ttg gac aaa gag gaa gcg aca tea act gat cag ata gag agt 355 

Glu Thr Leu Asp Lys Glu Glu Ala Thr Ser Thr Asp Gin He Glu Ser 
75 80 85 

agt gac acg cct gag gat aat cag cag acg aca cct gat ggt aaa acc 403 

Ser Asp Thr Pro Glu Asd Asn Gin Gin Thr Thr Pro Asp Gly Lys Thr 
90 * 95 100 



451 
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gag acc aag ttc tgt tat tac aac aac tac aac ata aac cag cct cgt 499 
Glu Thr Lys Phe Cys Tyr Tyr Asn Asn Tyr Asn lie Asn Gin Pro Arg 
120 125 130 

cat ttc tgc aag get tgt cag aga tat tgg act get gga ggg act atg 54 7 

His Phe Cys Lys Ala Cys Gin Arg Tyr Trp Thr Ala Gly Gly Thr Met 
135 140 145 150 

agg aat gtt cct gtg ggg gca gga cgt cgt aag aac aaa age tea tct 595 
Arg Asn Val Pro Val Gly Ala Gly Arg Arg Lys Asn Lys Ser Ser Ser 

155 160 165 

tct cat tac cgt cac ate act att tec gag get ctt gag get gcg agg 643 
Ser His Tyr Arg His lie Thr lie Ser Glu Ala Leu Glu Ala Ala Arg 
170 175 180 

ctt gac ccg ggc tta" cag gca aac aca agg gtc ttg agt ttt ggt etc 691 
Leu Asp Pro Gly Leu Gin Ala Asn Thr Arg Val Leu Ser Phe Gly Leu 
185 190 195 

gaa get cag cag cag cac gtt get get ccc atg aca cct gtt atg aag 73 9 

Glu Ala Gin Gin Gin His Val Ala Ala Pro Met Thr Pro Val Met Lys 
200 205 210 

eta caa gaa gat caa aag gtc tea aac ggt get agg aac agg ttt cac 787 
Leu Gin Glu Asp Gin Lys Val Ser Asn Gly Ala Arg Asn Arg Phe His 
215 220 225 230 

ggg tta gcg gat caa egg ctt gta get egg gta gag aat gga gat gat 835 
Gly Leu Ala Asp Gin Arg Leu Val Ala Arg Val Glu Asn Gly Asp Asp 
235 240 245 

tgc tea age gga tec tct gtg acc acc tct aac aat cac tea gtg gat 883 
Cys Ser Ser Gly Ser Ser Val Thr Thr Ser Asn Asn His Ser Val Asp 
250 255 260 

gaa tea aga gca caa age ggc agt gtt gtt gaa gca caa atg aac aac 931 
Glu Ser Arg Ala Gin Ser Gly Ser Val Val Glu Ala Gin Met Asn Asn 
265 270 275 

aac aac aac aat aac atg aat ggt tat get tgc ate cca ggt gtt cca 979 
Asn Asn Asn Asn Asn Met Asn Gly Tyr Ala Cys lie Pro Gly Val Pro 
280 285 290 

tgg cct tac acg tgg aat cca gcg atg cct cca cca ggt ttt tac ccg 1027 
Trp Pro Tyr Thr Trp Asn Pro Ala Met Pro Pro Pro Gly Phe Tyr Pro 
295 300 305 310 

cct cca ggg tat cca atg ccg ttt tac cct tac tgg acc ate cca atg 1075 
Pro Pro Gly Tyr Pro Met Pro Phe Tyr Pro Tyr Trp Thr lie Pro Met 
315 320 325 

eta cca ccg cat caa tec tea teg cct ata age caa aag tgt tea aat 1123 
Leu Pro Pro His Gin Ser Ser Ser Pro lie Ser Gin Lys Cys Ser Asn 
330 335 340 

aca aac tct ccg act etc gga aag cat ccg aga gat gaa gga tea teg 1171 
Thr Asn Ser Pro Thr Leu Gly Lys His Pro Arg Asp Glu Gly Ser Ser 
345 350 355 

aaa aag gac aat gag aca gag cga aaa cag aag gee ggg tgc gtt ctg 1219 
Lys Lys Asp Asn Glu Thr Glu Arg Lys Gin Lys Ala Gly Cys Val Leu 
360 365 370 

gtc ccg aaa acg ttg aga ata gat gat cct aac gaa gca gca aag age 1267 
Val Pro Lys Thr Leu Arg lie Asp Asp Pro Asn Glu Ala Ala Lys Ser 
375 380 385 390 

teg ata tgg aca aca ttg gga ate aag aac gag gcg atg tgc aaa gee 1315 
Ser lie Trp Thr Thr Leu Gly lie Lys Asn Glu Ala Met Cys Lys Ala 
395 400 405 

ggt ggt atg ttc aaa ggg ttt gat cat aag aca aag atg tat aac aac 1363 
Gly Gly Met Phe Lys Gly Phe Asp His Lys Thr Lys Met Tyr Asn Asn 
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410 



MBI0018 Sequence 
415 



gac aaa get gag aac tec cct gtt ctt tct get 
Asp Lys Ala Glu Asn Ser Pro Val Leu Ser Ala 
425 430 

tea aga tea cac aat ttc cat gaa cag att tag 
Ser Arg Ser His Asn Phe His Glu Gin lie 
440 445 

atatgtatga ttgattgtat gtatagatga tactggagaa 

aactcttttc ttctttctag tgattgeett tattccttta 

actatttgat ttaccttttt tactttcttt cttcatttgt 

cattaatggt aaaaagttgg tgtggaccgt tgttgcgttg 

aaa 



Listing .ST25 
420 

aac cct get get eta 
Asn Pro Ala Ala Leu 
435 



tgatgagttt ttgagaatca 
catgttttgg ttctctgtac 
caggaaatgt tggaagataa 
gcatttcaaa aaaaaaaaaa 



1411 



agttacatat gtatatgtat 1464 



1524 
1584 
1644 
1704 
1707 



<210> 20 
<211> 448 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 20 

Met Met Met Glu Thr Arg Asp Pro Ala lie Lys Leu Phe Gly Met Lys 
15 10 15 



lie Pro Phe Pro Ser Val Phe Glu Ser Ala Val Thr Val Glu Asp Asp 
20 25 30 



Glu Glu Asp Asp Trp Ser Gly Gly Asp Asp Lys Ser Pro Glu Lys Val 
35 40 45 



Thr Pro Glu Leu Ser Asp Lys Asn Asn Asn Asn Cys Asn Asp Asn Ser 
50 55 60 



Phe Asn Asn Ser Lys Pro Glu Thr Leu Asp Lys Glu Glu Ala Thr Ser 
65 70 75 80 



Thr Asp Gin lie Glu Ser Ser Asp Thr Pro Glu Asp Asn Gin Gin Thr 
85 90 95 



Thr Pro Asp Gly Lys Thr Leu Lys Lys Pro Thr Lys lie Leu Pro Cys 
100 105 110 



Pro Arg Cys Lys Ser Met Glu Thr Lys Phe Cys Tyr Tyr Asn Asn Tyr 
115 120 125 



Asn lie Asn Gin Pro Arg His Phe Cys Lys Ala Cys Gin Arg Tyr Trp 
130 135 140 



Thr Ala Gly Gly Thr Met Arg Asn Val Pro Val Gly Ala Gly Arg Arg 

145 150 155 160 

Lys Asn Lys Ser Ser Ser Ser His Tyr Arg His He Thr He Ser Glu 

165 170 175 



Ala Leu Glu Ala Ala Arg Leu Asp Pro Gly Leu Gin Ala Asn Thr Arg 
180 185 190 
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Val Leu Ser Phe Gly Leu Glu Ala Gin Gin Gin His Val Ala Ala Pro 
195 200 205 

Met Thr Pro Val Met Lys Leu Gin Glu Asp Gin Lys Val Ser Asn Gly 
210 215 220 

Ala Arg Asn Arg Phe His Gly Leu Ala Asp Gin Arg Leu Val Ala Arg 
225 230 235 240 

Val Glu Asn Gly Asp Asp Cys Ser Ser Gly Ser Ser Val Thr Thr Ser 
245 250 255 

Asn Asn His Ser Val- Asp Glu Ser Arg Ala Gin Ser Gly Ser Val Val 
260 265 270 



Glu Ala Gin Met Asn Asn Asn Asn Asn Asn Asn Met Asn Gly Tyr Ala 
275 280 285 



Cys lie Pro Gly Val Pro Trp Pro Tyr Thr Trp Asn Pro Ala Met Pro 
290 295 300 



Pro Pro Gly Phe Tyr Pro Pro Pro Gly Tyr Pro Met Pro Phe Tyr Pro 
305 310 315 320 

Tyr Trp Thr lie Pro Met Leu Pro Pro His Gin Ser Ser Ser Pro lie 
325 330 335 



Ser Gin Lys Cys Ser Asn Thr Asn Ser Pro Thr Leu Gly Lys His Pro 

340 345 350 

Arg Asp Glu Gly Ser Ser Lys Lys Asp Asn Glu Thr Glu Arg Lys Gin 

355 360 365 



Lys Ala Gly Cys Val Leu Val Pro Lys Thr Leu Arg lie Asp Asp Pro 
370 375 380 



Asn Glu Ala Ala Lys Ser Ser lie Trp Thr Thr Leu Gly lie Lys Asn 
385 390 395 400 



Glu Ala Met Cys Lys Ala Gly Gly Met Phe Lys Gly Phe Asp His Lys 
405 410 415 



Thr Lys Met Tyr Asn Asn Asp Lys Ala Glu Asn Ser Pro Val Leu Ser 
420 425 430 



Ala Asn Pro Ala Ala Leu Ser Arg Ser His Asn Phe His Glu Gin lie 
435 440 445 



<210> 21 

<211> 1149 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (1) . . (1149) 

<223> G431 
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<400> 21 

atg gag agt ggt tec aac age act tct tgt cca atg get ttt gec ggg 4 8 

Met Glu Ser Gly Ser Aen Ser Thr Ser Cys Pro Met Ala Phe Ala Gly 
15 10 15 

gat aat agt gat ggt ccg atg tgt cct atg atg atg atg atg ccg ccc 96 
Asp Asn Ser Asp Gly Pro Met Cys Pro Met Met Met Met Met Pro Pro 
20 25 30 

ate atg aca tea cat caa cat cat ggt cat gat cat caa cat caa caa 144 
He Met Thr Ser His Gin His His Gly His Asp His Gin His Gin Gin 
35 40 45 

caa gaa cat gat ggt tat gca tat cag tea cac cac caa caa agt agt 192 
Gin Glu His Asp Gly Tyr Ala Tyr Gin Ser His His Gin Gin Ser Ser 
50 55 - 60 

tec ctt ttt ctt caa tea eta get cct ccc caa gga act aag aac aaa 240 
Ser Leu Phe Leu Gin Ser Leu Ala Pro Pro Gin Gly Thr Lys Asn Lys 
65 70 75 80 

gtt get tct tct tct tct cct tec tct tgt get cct gee tat tct eta 288 
Val Ala Ser Ser Ser Ser Pro Ser Ser Cys Ala Pro Ala Tyr Ser Leu 
85 90 95 

atg gag ate cat cat aac gaa ate gtt gca gga gga ate aac cct tgc 336 
Met Glu He His His Asn Glu He Val Ala Gly Gly He Asn Pro Cys 
100 105 110 

tec tct ttc tct tct tea gee tct gtc aag gee aag ate atg get cat 384 
Ser Ser Phe Ser Ser Ser Ala Ser Val Lys Ala Lys He Met Ala His 
115 120 125 

cct cac tac cac cgc etc ttg gec get tat gtc aat tgt cag aag gtt 432 
Pro His Tyr His Arg Leu Leu Ala Ala Tyr Val Asn Cys Gin Lys Val 
130 135 140 

gga gca cca ccg gag gtt gtg gcg agg ctg gag gag gca tgc teg tct 480 
Gly Ala Pro Pro Glu Val Val Ala Arg Leu Glu Glu Ala Cys Ser Ser 
145 150 155 160 

gee gca gec gca gec gca tct atg ggg cca aca ggg tgt ctt ggt gaa 528 
Ala Ala Ala Ala Ala Ala Ser Met Gly Pro Thr Gly Cys Leu Gly Glu 
165 170 175 

gat cca ggg ctt gat caa ttc atg gaa get tac tgt gaa atg etc gtt 576 
Asp Pro Gly Leu Asp Gin Phe Met Glu Ala Tyr Cys Glu Met Leu Val 
180 185 190 

aag tat gag caa gag etc tec aaa cct ttc aag gaa get atg gtc ttc 624 
Lys Tyr Glu Gin Glu Leu Ser Lys Pro Phe Lys Glu Ala Met Val Phe 
195 200 205 

ctt caa cgt gtc gag tgt caa ttc aaa tec etc tct eta tec tea cct 672 
Leu Gin Arg Val Glu Cys Gin Phe Lys Ser Leu Ser Leu Ser Ser Pro 
210 215 220 

tec tct ttc tec ggt tat gga gag aca gca att gat agg aac aat aat 720 
Ser Ser Phe Ser Gly Tyr Gly Glu Thr Ala lie Asp Arg Asn Aen Asn 
225 230 235 240 

ggg tea tec gag gaa gaa gtc gat atg aac aat gaa ttt gta gat cca 768 
Gly Ser Ser Glu Glu Glu Val Asp Met Asn Asn Glu Phe Val Asp Pro 
245 250 255 

caa get gag gat aga gag ctt aaa gga cag etc ttg cgc aag tac agt 816 
Gin Ala Glu Asp Arg Glu Leu Lys Gly Gin Leu Leu Arg Lys Tyr Ser 
260 265 270 

ggt tac tta ggg age etc aag caa gag ttc atg aag aag agg aag aaa 864 
Gly Tyr Leu Gly Ser Leu Lys Gin Glu Phe Met Lys Lys Arg Lys Lys 
275 280 285 

gga aag etc cct aaa gaa get cgt caa caa ctg ctt gat tgg tgg age 912 
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Gly Lys Leu Pro Lys Glu Ala Arg Gin Gin Leu Leu Asp Trp Trp Ser 
290 295 300 

cgt cac tac aaa tgg cct tac cct teg gag caa caa aag etc gec ctt 960 
Arg His Tyr Lys Trp Pro Tyr Pro Ser Glu Gin Gin Lys Leu Ala Leu 
305 310 315 320 v , 

gcg gaa tea acg ggg ctg gac cag aaa cag ata aac aat tgg ttc ata 1008 
Ala Glu Ser Thr Gly Leu Asp Gin Lys Gin lie Asn Asn Trp Phe lie 
325 330 335 

aac cag agg aaa egg cat tgg aag ccg teg gag gac atg cag ttt gta 1056 
Asn Gin Arg Lys Arg His Trp Lys Pro Ser Glu Asp Met Gin Phe Val 
340 345 350 

gta atg gac gca aca cat cct cac cat tac ttc atg gat aat gtc ttg 1104 
Val Met Asp Ala Thr His Pro His His Tyr Phe Met Asp Asn Val Leu 

355 - . .. 360 365 

gac aat cct ttc cca atg gat cac ate tec tec ace atg ctt tga 1149 
Asp Asn Pro Phe Pro Met Asp His lie Ser Ser Thr Met Leu 
370 375 . 380 

<210> 22 
<211> 382 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 22 

Met Glu Ser Gly Ser Asn Ser Thr Ser Cys Pro Met Ala Phe Ala Gly 
15 10 15 

Asp Asn Ser Asp Gly Pro Met Cys Pro Met Met Met Met Met Pro Pro 
20 25 30 

He Met Thr Ser His Gin His His Gly Hib Asp His Gin His Gin Gin 
35 40 45 

Gin Glu His Asp Gly Tyr Ala Tyr Gin Ser His His Gin Gin Ser Ser 
50 55 60 

Ser Leu Phe Leu Gin Ser Leu Ala Pro Pro Gin Gly Thr Lys Asn Lys 
65 70 75 80 

Val Ala Ser Ser Ser Ser Pro Ser Ser Cys Ala Pro Ala Tyr Ser Leu 

85 90 95 

Met Glu He His His Asn Glu He Val Ala Gly Gly He Asn Pro Cys 
100 105 110 

Ser Ser Phe Ser Ser Ser Ala Ser Val Lys Ala Lys He Met Ala His 
115 120 125 

Pro His Tyr His Arg Leu Leu Ala Ala Tyr Val Asn Cys Gin Lys Val 
130 135 140 

Gly Ala Pro Pro Glu Val Val Ala Arg Leu Glu Glu Ala Cys Ser Ser 
145' 150 155 160 

Ala Ala Ala Ala Ala Ala Ser Met Gly Pro Thr Gly Cys Leu Gly Glu 
165 170 175 
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Asp Pro Gly Leu Asp Gin Phe Met Glu Ala Tyr Cys Glu Met Leu Val 
180 185 190 

Lys Tyr Glu Gin Glu Leu Ser Lys Pro Phe Lys Glu Ala Met Val Phe 
195 200 205 

Leu Gin Arg Val Glu Cys Gin Phe Lys Ser Leu Ser Leu Ser Ser Pro 
210 215 220 

Ser Ser Phe Ser Gly Tyr Gly Glu Thr Ala lie Asp Arg Asn Asn Asn 
225 230 235 240 

Gly Ser Ser Glu Glu Glu Val' Asp Met Asn Asn Glu Phe Val Asp Pro 
- 24* ■ 250 255 



Gin Ala Glu Asp Arg Glu Leu Lys Gly Gin Leu Leu Arg Lys Tyr Ser 
260 265 270 

Gly Tyr Leu Gly Ser Leu Lys Gin Glu Phe Met Lys Lys Arg Lys Lys 
275 280 285 

Gly Lys Leu Pro Lys Glu Ala Arg Gin Gin Leu Leu Asp Trp Trp Ser 
290 295 300 

Arg His Tyr Lys Trp Pro Tyr Pro Ser Glu Gin Gin Lys Leu Ala Leu 
305 310 315 320 

Ala Glu Ser Thr Gly Leu Asp Gin Lys Gin lie Asn Asn Trp Phe lie 
325 330 335 

Asn Gin Arg Lys Arg His Trp Lys Pro Ser Glu Asp Met Gin Phe Val 
340 345 350 

Val Met Asp Ala Thr His Pro His His Tyr Phe Met Asp Asn Val Leu 
355 360 365 



Asp Asn Pro Phe Pro Met Asp His He Ser Ser Thr Met Leu 
370 375 380 



<210> 


23 


<211> 


1136 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(118) . . (1074) 


<223> 


G187 


<400> 


23 



tagacctctt aggaaaaaaa cctaaaaacc taatccccaa acctaaaagg cttatctcat 

ctcttcttct ttgtcttctt tactcttttt ttacctctct cttcattgtt cttcacc 

atg tct aat gaa acc aga gat etc tac aac tac caa tac cct tea teg 
Met Ser Asn Glu Thr Arg Asp Leu Tyr Asn Tyr Gin Tyr Pro Ser Ser 
15 10 15 

ttt teg ttg cac gaa atg atg aat ctg cct act tea aat cca tct tct 
Phe Ser Leu His Glu Met Met Asn Leu Pro Thr Ser Asn Pro Ser Ser 
20 25 30 



60 
117 
165 

213 
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tat gga aac etc cca tea caa aac ggt ttt aat cca tct act tat tec 
Tyr Gly Asn Leu Pro Ser Gin Asn Gly Phe Asn Pro Ser Thr Tyr Ser 
35 40 45 



261 



ttc acc gat tgt etc caa agt tct cca gca gcg tat gaa tct eta ctt 
Phe Thr Asp Cys Leu Gin Ser Ser Pro Ala Ala Tyr Glu Ser Leu Leu 
50 55 60 



309 



cag aaa act ttt ggt ctt tct ccc tct tec tea gag gtt ttc aat tct 
Gin Lys Thr Phe Gly Leu Ser Pro Ser Ser Ser Glu Val Phe Asn Ser 
65 70 75 80 



357 



teg ate gat caa gaa ccg aac cgt gat gtt act aat gac gta ate aat 
Ser lie Asp Gin Glu Pro Asn Arg Asp Val Thr Asn Asp Val lie Asn 
85 90 95 



405 



ggt ggt gca tgc aac gag act gaa act agg gtt tct cct tct aat tct 
Gly Gly Ala Cys Asn Glu Thr Glu Thr Arg Val Ser Pro Ser Asn Ser 
100 105 110 



453 



tec tct agt gag get gat cac ccc ggt gaa gat tec ggt aag age egg 
Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser Gly Lys Ser Arg 
115 120 125 



501 



agg aaa cga gag tta gtc ggt gaa gaa gat caa att tec aaa aaa gtt 54 9 

Arg Lys Arg Glu Leu Val Gly Glu Glu Asp Gin lie Ser Lys Lys Val 
130 135 140 

ggg aaa acg aaa aag act gag gtg aag aaa caa aga gag cca cga gtc 597 
Gly Lys Thr Lys Lys Thr Glu Val Lys Lys Gin Arg Glu Pro Arg Val 
145 150 155 160 

teg ttt atg act aaa agt gaa gtt gat cat ctt gaa gat ggt tat aga 645 
Ser Phe Met Thr Lys Ser Glu Val Asp His Leu Glu Asp Gly Tyr Arg 
165 170 175 



t99 a 9 a aaa tac ggc caa aag get gta aaa aat age cct tat cca agg 
Trp Arg Lys Tyr Gly Gin Lys Ala Val Lys Asn Ser Pro Tyr Pro Arg 
180 185 190 



693 



agt tac tat aga tgt aca aca caa aag tgc aac gtg aag aaa cga gtg 

Ser Tyr Tyr Arg Cys Thr Thr Gin Lys Cys Asn Val Lys Lys Arg Val 
195 200 205 

gag aga teg ttc caa gat cca acg gtt gtg att aca act tac gag ggt 

Glu Arg Ser Phe Gin Asp Pro Thr Val Val lie Thr Thr Tyr Glu Gly 
210 215 220 



741 



789 



caa cac aac cac ccg att ccg act aat ctt cga gga agt tct gee gcg 
Gin Hie Asn His Pro lie Pro Thr Asn Leu Arg Gly Ser Ser Ala Ala 
225 230 235 240 



837 



get get atg ttc tec gca gac etc atg act cca aga age ttt gca cat 
Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg Ser Phe Ala His 
245 250 255 



885 



gat atg ttt agg acg gca get tat act aac ggc 
Asp Met Phe Arg Thr Ala Ala Tyr Thr Asn Gly 
260 265 



ggt tct gtg gcg gcg 
Gly Ser Val Ala Ala 
270 



933 



get ttg gat tat gga tat gga caa agt ggt tat 
Ala Leu Asp Tyr Gly Tyr Gly Gin Ser Gly Tyr 
275 280 

aac cct agt tct cac caa gtg tat cat caa ggg 
Asn Pro Ser Ser His Gin Val Tyr His Gin Gly 
290 295 



ggt agt gtg aat tea 
Gly Ser Val Asn Ser 
285 

ggt gag tat gag etc 
Gly Glu Tyr Glu Leu 
300 



981 



1029 



ttg agg gag att ttt cct tea att ttc ttt aag 
Leu Arg Glu lie Phe Pro Ser lie Phe Phe Lys 
305 310 315 



caa gag cct tga 
Gin Glu Pro 



1074 



tcgatcattg ttataactac atatattata tatattgaga gagagaggta gagaaaaaaa 1134 
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aa 1136 

<210> 24 
<211> 318 
<212> PRT 

<213> Arabidopsis tha liana 
<400> 24 

Met Ser Asn Glu Thr Arg Asp Leu Tyr Asn Tyr Gin Tyr Pro Ser Ser 
15 10 15 

Phe Ser Leu His Glu Met Met Asn Leu Pro Thr Ser Asn Pro Ser Ser 
20 25 30 

Tyr Gly Asn Leu Pro Ser Gin Asn Gly Phe Asn Pro Ser Thr Tyr Ser 
35 40 45 

Phe Thr Asp Cys Leu Gin Ser Ser Pro Ala Ala Tyr Glu Ser Leu Leu 
50 55 60 

Gin Lys Thr Phe Gly Leu Ser Pro Ser Ser Ser Glu Val Phe Asn Ser 
65 70 75 80 

Ser lie Asp Gin Glu Pro Asn Arg Asp Val Thr Asn Asp Val lie Asn 
85 90 95 

Gly Gly Ala Cys Asn Glu Thr Glu Thr Arg Val Ser Pro Ser Asn Ser 
100 105 110 

Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser Gly Lys Ser Arg 
115 120 125 

Arg Lys Arg Glu Leu Val Gly Glu Glu Asp Gin lie Ser Lys Lys Val 
130 135 140 

Gly Lys Thr Lys Lys Thr Glu Val Lys Lys Gin Arg Glu Pro Arg Val 
145 150 155 160 

Ser Phe Met Thr Lys Ser Glu Val Asp His Leu Glu Asp Gly Tyr Arg 
165 170 175 

Trp Arg Lys Tyr Gly Gin Lys Ala Val Lys Asn Ser Pro Tyr Pro Arg 
180 185 190 

Ser Tyr Tyr Arg Cys Thr Thr Gin Lys Cys Asn Val Lys Lys Arg Val 
195 200 205 

Glu Arg Ser Phe Gin Asp Pro Thr Val Val He Thr Thr Tyr Glu Gly 
210 215 220 

Gin His Asn His Pro He Pro Thr Asn Leu Arg Gly Ser Ser Ala Ala 
225 230 235 240 

Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg Ser Phe Ala His 
245 250 255 



Asp Met Phe Arg Thr Ala Ala Tyr Thr Asn Gly Gly Ser Val Ala Ala 
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260 265 270 

Ala Leu Asp Tyr Gly Tyr Gly Gin Ser Gly Tyr Gly Ser Val Asn Ser 
275 280 285 

Asn Pro Ser Ser His Gin Val Tyr His Gin Gly Gly Glu Tyr Glu Leu 
290 295 300 

Leu Arg Glu lie Phe Pro Ser He Phe Phe Lys Gin Glu Pro 
305 310 315 



<210> 


25 


<211> 


2580 


<212> 


DNA , 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2580) 


<223> 


G470 


<400> 


25 



atg gcg agt teg gag gtt tea atg aaa ggt aat cgt gga gga gat aac 48 
Met Ala Ser Ser Glu Val Ser Met Lys Gly Asn Arg Gly Gly Asp Asn 
1 5 10 15 

ttc tec tec tct ggt ttt agt gac cct aag gag act aga aat gtc tec 96 
Phe Ser Ser Ser Gly Phe Ser Asp Pro Lys Glu Thr Arg Asn Val Ser 
20 25 30 

gtc gec ggc gag ggg caa aaa agt aat tct ace cga tec get gcg get 144 
Val Ala Gly Glu Gly Gin Lys Ser Asn Ser Thr Arg Ser Ala Ala Ala 
35 40 45 

gag cgt get ttg gac cct gag get get ctt tac aga gag eta tgg cac 192 
Glu Arg Ala Leu Asp Pro Glu Ala Ala Leu Tyr Arg Glu Leu Trp His 
50 55 60 

get tgt get ggt ccg ctt gtg acg gtt cct aga caa gac gac cga gtc 240 
Ala Cys Ala Gly Pro Leu Val Thr Val Pro Arg Gin Asp Asp Arg Val 
65 70 75 80 

ttc tat ttt cct caa gga cac ate gag cag gtg gag get teg acg aac 288 
Phe Tyr Phe Pro Gin Gly His He Glu Gin Val Glu Ala Ser Thr Asn 
85 90 95 

cag gcg gca gaa caa cag atg cct etc tat gat ctt ccg tea aag ctt 336 
Gin Ala Ala Glu Gin Gin Met Pro Leu Tyr Asp Leu Pro Ser Lys Leu 
100 105 110 



etc tgt cga gtt att aat gta gat tta aag gca gag gca gat aca gat 
Leu Cys Arg Val He Asn Val Asp Leu Lys Ala Glu Ala Asp Thr Asp 
115 120 125 



384 



gaa gtt tat gcg cag att act ctt ctt cct gag get aat caa gac gag 432 
Glu Val Tyr Ala Gin He Thr Leu Leu Pro Glu Ala Asn Gin Asp Glu 
130 135 140 

aat gca att gag aaa gaa gcg cct ctt cct cca cct ccg agg ttc cag 480 
Asn Ala He Glu Lys Glu Ala Pro Leu Pro Pro Pro Pro Arg Phe Gin 
145 150 155 160 

gtg cat teg ttc tgc aaa acc ttg act gca tec gac aca agt aca cat 528 
Val His Ser Phe Cys Lys Thr Leu Thr Ala Ser Asp Thr Ser Thr His 
165 170 175 

ggt gga ttt tct gtt ctt agg cga cat gcg gat gaa tgt etc cca cct 576 
Gly Gly Phe Ser Val Leu Arg Arg His Ala Asp Glu Cys Leu Pro Pro 
180 185 190 
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ctg gat atg tct cga cag cct ccc act caa gag tta gtt gca aag gat 
Leu Asp Met Ser Arg Gin Pro Pro Thr Gin Glu Leu Val Ala Lys Asp 
195 200 205 



624 



ttg cat gca aat gag tgg cga ttc aga cat ata ttc egg ggt caa cca 
Leu His Ala Asn Glu Trp Arg Phe Arg His lie Phe Arg Gly Gin Pro 
210 215 220 

egg agg cat ttg eta cag agt ggg tgg agt gtg ttt gtt age tec aaa 
Arg Arg His Leu Leu Gin Ser Gly Trp Ser Val Phe Val Ser Ser Lys 
225 230 235 240 

agg eta gtt gca ggc gat gcg ttt ata ttt eta agg ggc gag aat gga 
Arg Leu Val Ala Gly Asp Ala Phe lie Phe Leu Arg Gly Glu Asn Gly 
245 250 255 

gaa tta aga gtt ggt gta agg" cgt gcg atg cga caa caa gga aac gtg 
Glu Leu Arg Val Gly Val Arg Arg Ala Met Arg Gin Gin Gly Asn Val 
260 265 270 

ccg tct tct gtt ata tct age cat age atg cat ctt gga gta ctg gec 
Pro Ser Ser Val lie Ser Ser His Ser Met His Leu Gly Val Leu Ala 
275 280 285 

ace gca tgg cat gec att tea aca ggg act atg ttt aca gtc tac tac 
Thr Ala Trp His Ala He Ser Thr Gly Thr Met Phe Thr Val Tyr Tyr 
290 295 300 

aaa ccc agg acg age cca tct gag ttt att gtt ccg ttc gat cag tat 
Lys Pro Arg Thr Ser Pro Ser Glu Phe He Val Pro Phe Asp Gin Tyr 
305 310 315 320 

atg gag tct gtt aag aat aac tac tct att ggc atg aga ttc aaa atg 
Met Glu Ser Val Lys Asn Asn Tyr Ser He Gly Met Arg Phe Lys Met 
325 330 335 

aga ttt gaa ggc gaa' gag get cct gag cag agg ttt act ggc aca ate 
Arg Phe Glu Gly Glu Glu Ala Pro Glu Gin Arg Phe Thr Gly Thr He 
340 345 350 

gtt ggg att gaa gag tct gat cct act agg tgg cca aaa tea aag tgg 
Val Gly He Glu Glu Ser Asp Pro Thr Arg Trp Pro Lys Ser Lys Trp 
355 360 365 

aga tec etc aag gtg aga tgg gat gag act tct agt att cct cga cct 
Arg Ser Leu Lys Val Arg Trp Asp Glu Thr Ser Ser He Pro Arg Pro 
370 375 380 

gat aga gta tct ccg tgg aaa gta gag cca get ctt get cct cct get 
Asp Arg val Ser Pro Trp Lys Val Glu Pro Ala Leu Ala Pro Pro Ala 
385 390 395 400 

ttg agt cct gtt cca atg cct agg cct aag agg ccc aga tea aat ata 
Leu Ser Pro Val Pro Met Pro Arg Pro Lys Arg Pro Arg Ser Asn He 
405 410 415 

gca cct tea tct cct gac tct teg atg ctt acc aga gaa ggt aca act 
Ala Pro Ser Ser Pro Asp Ser Ser Met Leu Thr Arg Glu Gly Thr Thr 
420 425 430 

aag gca aac atg gac cct tta cca gca age gga ctt tea agg gtc ttg 
Lys Ala Asn Met Asp Pro Leu Pro Ala Ser Gly Leu Ser Arg Val Leu 
435 440 445 

caa ggt caa gaa tac teg acc ttg agg acg aaa cat act gag agt gta 
Gin Gly Gin Glu Tyr Ser Thr Leu Arg Thr Lys His Thr Glu Ser Val 
450 455 460 

gag tgt gat get cct gag aat tct gtt gtc tgg caa tct tea gcg gat 
Glu Cya Asp Ala Pro Glu Asn Ser Val Val Trp Gin Ser Ser Ala Asp 
465 470 475 480 

gat gat aag gtt gac gtg gtt teg ggt tct aga aga tat gga tct gag 
Asp Asp Lys Val Asp Val Val Ser Gly Ser Arg Arg Tyr Gly Ser Glu 
485 490 495 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 
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aac tgg atg tec tea gec agg cat gaa cct act tac aca gat ttg etc 
Asn Trp Met Ser Ser Ala Arg His Glu Pro Thr Tyr Thr Asp Leu Leu 
500 505 510 



1536 



tec ggc ttt ggg act aac ata gat cca tec cat ggt cag egg ata cct 
Ser Gly Phe Gly Thr Asn lie Asp Pro Ser His Gly Gin Arg lie Pro 
515 520 525 



1584 



ttt tat gac cat tea tea tea cct tct atg cct gca aag aga ate ttg 
Phe Tyr Asp His Ser Ser Ser Pro Ser Met Pro Ala Lys Arg lie Leu 
530 535 540 



1632 



agt gat tea gaa ggc aag ttc gat tat ctt get aac cag tgg cag atg 
Ser Asp Ser Glu Gly Lys Phe Asp Tyr Leu Ala Asn Gin Trp Gin Met 
545 550 555 560 



1680 



ata cac tct ggt etc' tec ctg aag tta cat gaa tct cct aag gta cct 
He His Ser Gly Leu Ser Leu Lys Leu His Glu Ser Pro Lys Val Pro 
565 570 575 



1728 



gca gca act gat gcg tct etc caa ggg cga tgc aat gtt aaa tac age 
Ala Ala Thr Asp Ala Ser Leu Gin Gly Arg Cys Asn Val Lys Tyr Ser 
580 585 590 



1776 



gaa tat cct gtt ctt aat ggt eta teg act gag aat get ggt ggt aac 
Glu Tyr Pro Val Leu Asn Gly Leu Ser Thr Glu Asn Ala Gly Gly Asn 
595 600 605 



1824 



tgg cca ata cgt cca cgt get ttg aat tat tat gag gaa gtg gtc aat 
Trp Pro lie Arg Pro Arg Ala Leu Asn Tyr Tyr Glu Glu Val Val Asn 
610 615 620 



1872 



get caa gcg caa get cag get agg gag caa gta aca aaa caa ecc ttc 
Ala Gin Ala Gin Ala Gin Ala Arg Glu Gin Val Thr Lys Gin Pro Phe 
625 630 635 640 



1920 



acg ata caa gag gag aca gca aag tea aga gaa ggg aac tgc agg etc 
Thr He Gin Glu Glu Thr Ala Lys Ser Arg Glu Gly Asn Cys Arg Leu 
645 650 655 



1968 



ttt ggc att cct ctg ace aac aac atg aat ggg aca gac tea acc atg 
Phe Gly He Pro Leu Thr Asn Asn Met Asn Gly Thr Asp Ser Thr Met 
660 665 670 



2016 



tct cag aga aac aac ttg aat gat get gcg ggg ctt aca cag ata gca 
Ser Gin Arg Asn Asn Leu Asn Asp Ala Ala Gly Leu Thr Gin He Ala 
675 680 685 



2064 



tea cca aag gtt cag gac ctt tea gat cag tea aaa ggg tea aaa tea 
Ser Pro Lys Val Gin Asp Leu Ser Asp Gin Ser Lys Gly Ser Lys Ser 
690 695 700 



2112 



aca aac gat cat cgt gaa cag gga aga cca ttc cag act aat aat cct 
Thr Asn Asp His Arg Glu Gin Gly Arg Pro Phe Gin Thr Asn Asn Pro 
705 710 715 720 



2160 



cat ccg aag gat get caa acg aaa acc aac tea agt agg agt tgc aca 
His Pro Lys Asp Ala Gin Thr Lys Thr Asn Ser Ser Arg Ser Cys Thr 
725 730 735 



2208 



aag gtt cac aag cag gga att gca ctt ggc cgt tea gtg gat ctt tea 
Lys Val His Lys Gin Gly He Ala Leu Gly Arg Ser Val Asp Leu Ser 
740 745 750 



2256 



aag ttc caa aac tat gag gag tta gtc get gag ctg gac agg ctg ttt 
Lys Phe Gin Asn Tyr Glu Glu Leu Val Ala Glu Leu Asp Arg Leu Phe 
755 760 765 



2304 



gag ttc aat gga gag ttg atg get cct aag aaa gat tgg ttg ata gtt 
Glu Phe Asn Gly Glu Leu Met Ala Pro Lys Lys Asp Trp Leu He Val 
770 775 780 



2352 



tac aca gat gaa gag aat gat atg atg ctt gtt ggt gac gat cct tgg 
Tyr Thr Asp Glu Glu Asn Asp Met Met Leu Val Gly Asp Asp Pro Trp 
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785 790 795 800 

cag gag ttt tgt tgc atg gtt cgc aaa ate ttc ata tac acg aaa gag 2448 
Gin Glu Phe Cys Cys Met Val Arg Lye lie Phe lie Tyr Thr Lys Glu 
805 810 815 

gaa gtg agg aag atg aac ccg ggg act tta age tgt agg age gag gaa 24 96 
Glu Val Arg Lys Met Asn Pro Gly Thr Leu Ser Cys Arg Ser Glu Glu 
820 825 830 

gaa gca gtt gtt ggg gaa gga tea gat gca aag gac gee aag tct gca 2544 
Glu Ala Val Val Gly Glu Gly Ser Asp Ala Lys Asp Ala Lys Ser Ala 
835 840 845 

tea aat cct tea ttg tec age get ggg aac tct taa 2580 
Ser Asn Pro Ser Leu Ser Ser Ala Gly Asn Ser 
850 85S 

<210> 26 
<211> 859 
<212> PRT ' 

<213> Arabidopsis thaliana 
<400> 26 

Met Ala Ser Ser Glu Val Ser Met Lys Gly Asn Arg Gly Gly Asp Asn 
15 10 15 

Phe Ser Ser Ser Gly Phe Ser Asp Pro Lys Glu Thr Arg Asn Val Ser 
20 25 30 

Val Ala Gly Glu Gly Gin Lys Ser Asn Ser Thr Arg Ser Ala Ala Ala 
35 40 45 

Glu Arg Ala Leu Asp Pro Glu Ala Ala Leu Tyr Arg Glu Leu Trp His 
50 55 60 

Ala Cys Ala Gly Pro Leu Val Thr Val Pro Arg Gin Asp Asp Arg Val 
65 70 75 80 

Phe Tyr Phe Pro Gin Gly His He Glu Gin Val Glu Ala Ser Thr Asn 
85 90 95 

Gin Ala Ala Glu Gin Gin Met Pro Leu Tyr Asp Leu Pro Ser Lys Leu 
100 105 110 

Leu Cys Arg Val He Asn Val Asp Leu Lys Ala Glu Ala Asp Thr Asp 
115 120 125 

Glu Val Tyr Ala Gin lie Thr Leu Leu Pro Glu Ala Asn Gin Asp Glu 
130 135 140 

Asn Ala He Glu Lys Glu Ala Pro Leu Pro Pro Pro Pro Arg Phe Gin 
145 150 155 160 

Val His Ser Phe Cys Lye Thr Leu Thr Ala Ser Asp Thr Ser Thr His 
165 170 175 

Gly Gly Phe Ser Val Leu Arg Arg His Ala Asp Glu Cys Leu Pro Pro 
180 185 190 



Leu Asp Met Ser Arg Gin Pro Pro Thr Gin Glu Leu Val Ala Lys Asp 
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195 200 205 

Leu His Ala Asn Glu Trp Arg Phe Arg His He Phe Arg Gly Gin Pro 
210 215 220 

Arg Arg His Leu Leu Gin Ser Gly Trp Ser Val Phe Val Ser Ser Lys 
225 230 235 240 

Arg Leu Val Ala Gly Asp Ala Phe He Phe Leu Arg Gly Glu Asn Gly 
245 250 255 

Glu Leu Arg Val Gly Val Arg Arg Ala Met Arg Gin Gin Gly Asn Val 
260 265 270 

Pro Ser Ser Val He Ser Ser His Ser Met His Leu Gly Val Leu Ala 
275 280 285 

Thr Ala Trp His Ala He Ser Thr Gly Thr Met Phe Thr Val Tyr Tyr 
290 295 300 

Lys Pro Arg Thr Ser Pro Ser Glu Phe He Val Pro Phe Asp Gin Tyr 
305 310 315 320 

Met Glu Ser Val Lys Asn Asn Tyr Ser He Gly Met Arg Phe Lys Met 
325 330 335 

Arg Phe Glu Gly Glu Glu Ala Pro Glu Gin Arg Phe Thr Gly Thr He 
340 345 350 

Val Gly He Glu Glu Ser Asp Pro Thr Arg Trp Pro Lys Ser Lys Trp 
355 360 365 

Arg Ser Leu Lys Val Arg Trp Asp Glu Thr Ser Ser He Pro Arg Pro . 
370 375 380 

Asp Arg Val Ser Pro Trp Lys Val Glu Pro Ala Leu Ala Pro Pro Ala 
385 390 395 400 

Leu Ser Pro Val Pro Met Pro Arg Pro Lys Arg Pro Arg Ser Asn He 
405 410 415 

Ala Pro Ser Ser Pro Asp Ser Ser Met Leu Thr Arg Glu Gly Thr Thr 
420 425 430 

Lys Ala Asn Met Asp Pro Leu Pro Ala Ser Gly Leu Ser Arg Val Leu 
435 440 445 

Gin Gly Gin Glu Tyr Ser Thr Leu Arg Thr Lys His Thr Glu Ser Val 
450 455 460 

Glu Cys Asp Ala Pro Glu Asn Ser Val Val Trp Gin Ser Ser Ala Asp 
465 470 475 480 

Asp Asp Lys Val Asp Val Val Ser Gly Ser Arg Arg Tyr Gly Ser Glu 
485 490 495 
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Asn Trp Met Ser Ser Ala Arg His Glu Pro Thr Tyr Thr Asp Leu Leu 
500 505 510 

Ser Gly Phe Gly Thr Asn lie Asp Pro Ser His Gly Gin Arg lie Pro 
515 520 525 

Phe Tyr Asp His Ser Ser Ser Pro Ser Met Pro Ala Lys Arg lie Leu 
530 535 540 

Ser Asp Ser Glu Gly Lys Phe Asp Tyr Leu Ala Asn Gin Trp Gin Met 
545 550 555 560 

lie His Ser Gly Leu Ser Leu Lys Leu His Glu Ser Pro Lys Val Pro' 
- 565 570 575 

Ala Ala Thr Asp Ala Ser Leu Gin Gly Arg Cys Asn Val Lys Tyr Ser 
580 585 590 

Glu Tyr Pro Val Leu Asn Gly Leu Ser Thr Glu Asn Ala Gly Gly Asn 
595 600 605 

Trp Pro lie Arg Pro Arg Ala Leu Asn Tyr Tyr Glu Glu Val Val Asn 
610 615 620 

Ala Gin Ala Gin Ala Gin Ala Arg Glu Gin Val Thr Lys Gin Pro Phe 
625 630 635 640 

Thr lie Gin Glu Glu Thr Ala Lys Ser Arg Glu Gly Asn Cys Arg Leu . 

645 650 655 

Phe Gly lie Pro Leu Thr Asn Asn Met Asn Gly Thr Asp Ser Thr Met 
660 665 670 

Ser Gin Arg Asn Asn Leu Asn Asp Ala Ala Gly Leu Thr Gin lie Ala 
675 680 685 

Ser Pro Lys Val Gin Asp Leu Ser Asp Gin Ser Lys Gly Ser Lys Ser 
690 695 700 

Thr Asn Asp His Arg Glu Gin Gly Arg Pro Phe Gin Thr Asn Asn Pro 
705 710 715 720 

His Pro Lys Asp Ala Gin Thr Lys Thr Asn Ser Ser Arg Ser Cys Thr 
725 730 735 

Lys Val His Lys Gin Gly lie Ala Leu Gly Arg Ser Val Asp Leu Ser 
740 745 750 

Lys Phe Gin Asn Tyr Glu Glu Leu val Ala Glu Leu Asp Arg Leu Phe 
755 760 765 

Glu Phe Asn Gly Glu Leu Met Ala Pro Lys Lys Asp Trp Leu lie Val 
770 775 780 

Tyr Thr Asp Glu Glu Asn Asp Met Met Leu Val Gly Asp Asp Pro Trp 
785 790 795 800 
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Gin Glu Phe Cys Cys Met Val Arg Lys lie Phe lie Tyr Thr Lys Glu 
805 810 815 

Glu Val Arg Lys Met Asn Pro Gly Thr Leu Ser Cys Arg Ser Glu Glu ^ 
820 825 830 

Glu Ala Val Val Gly Glu Gly Ser Asp Ala Lys Asp Ala Lys Ser Ala 
835 840 845 

Ser Asn Pro Ser Leu Ser Ser Ala Gly Asn Ser 
850 855 



<210> 


27 . 


<211> 


1519 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(197) . . (1252) 


<223> 


G615 


<400> 


27 



ttttttcttt tctttctttt tttgctggtg tgagaaattg tacgcttact atctctctct 60 

ctctctgcca gattctctct ttttgatgat gtgaaagttg tgcttttgtt tcttaagaaa 120 

aaggcatatt tttaatactt gattcttggt tcttgattct tgattcttgg ttttttttag 180 

cttcttaagt tcggtg atg teg tct tec ace aat gac tac aac gat ggt aat 232 

Met Ser Ser Ser Thr Asn Asp Tyr Asn Asp Gly Asn 
15 10 

aac aat gga gtg tac cct etc tct ctt tac ctt tct tea etc tct ggc 280 
Asn Asn Gly Val Tyr Pro Leu Ser Leu Tyr Leu Ser Ser Leu Ser Gly 
15 20 25 

cat caa gac ate att cat aat ccc tac aac cat cag tta aaa gca tct 328 
His Gin Asp lie lie His Asn Pro Tyr Asn His Gin Leu Lys Ala Ser 
30 35 40 

ccg ggc cat atg gta tea gca gtt cct gaa tct ctg ate gat tac atg 3 76 

Pro Gly His Met Val Ser Ala Val Pro Glu Ser Leu He Asp Tyr Met 
45 50 55 60 

gcg ttt aag tea aat aat gtt gtg aat caa caa ggc ttt gag ttt cct 424 
Ala Phe Lys Ser Asn Asn Val Val Asn Gin Gin Gly Phe Glu Phe Pro 
65 70 75 

gag gtg tea aag gaa ate aag aag gtg gtg aag aag gac cga cat age 4 72 

Glu Val Ser Lys Glu He Lys Lys Val Val Lys Lys Asp Arg His Ser 
80 85 90 

aag att caa acg gca caa ggg att aga gac agg agg gtt agg ctt ttt 520 
Lys He Gin Thr Ala Gin Gly He Arg Asp Arg Arg Val Arg Leu Phe 
95 100 105 

att ggg att get cgc caa ttc ttt gat ctt cag gat atg ttg ggg ttt 568 
He Gly He Ala Arg Gin Phe Phe Asp Leu Gin Asp Met Leu Gly Phe 
110 115 120 

gat aaa get agt aaa acg tta gac tgg ctg etc aag aag tea aga aaa 616 
Asp Lys Ala Ser Lys Thr Leu Asp Trp Leu Leu Lys Lys Ser Arg Lys 
125 130 135 140 

gec ate aaa gag gtc gta caa gca aaa aac etc aac aat gat gat gaa 664 
Ala lie Lys Glu Val Val Gin Ala Lys Asn Leu Asn Asn Asp Asp Glu 
145 150 155 
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gat ttt gga aac att gga ggc gat gta gaa caa gaa gag gag aag gag 
Asp Phe Gly Asn lie Gly Gly Asp Val Glu Gin Glu Glu Glu Lys Glu 
160 165 170 

gag gat gac aat ggc gat aag age ttc gtg tat ggt ttg age ccc ggg 
Glu Asp Asp Asn Gly Asp Lys Ser Phe Val Tyr Gly Leu Ser Pro Gly *■ 
175 180 185 

tac ggt gaa gaa gaa gtg gta tgt gag gec acg aag gca ggg ata aga 
Tyr Gly Glu Glu Glu Val Val Cys Glu Ala Thr Lys Ala Gly He Arg 
190 195 200 

aag aag aag agt gag ttg aga aac ate tea tea aag ggg eta gga gee 
Lys Lys Lys Ser Glu Leu Arg Asn He Ser Ser Lys Gly Leu Gly Ala 
205 210 215 220 

aaa get aga gga aaa gca aag gag cga aca aaa gag atg atg gee tat 
Lys Ala Arg Gly Lys Ala Lys Glu Arg Thr Lys Glu Met Met Ala Tyr 
225 230 235 

gat aat cca gag act gec tct gat att aca caa tct gaa ate atg gac 
Asp Asn Pro Glu Thr Ala Ser Asp He Thr Gin Ser Glu He Met Asp 
240 245 250 

cca ttc aag agg tct ata gtc ttc aat gaa gga gaa gat atg aca cac 
Pro Phe Lys Arg Ser He Val Phe Asn Glu Gly Glu Asp Met Thr His 
255 260 265 

ctt ttc tac aag gaa cca ate gag gag ttt gat aat caa gaa tct ate 
Leu Phe Tyr Lys Glu Pro He Glu Glu Phe Asp Asn Gin Glu Ser He 
270 275 280 

tta acc aat atg act eta cca acg aag atg ggt caa agt tac aat caa 
Leu Thr Asn Met Thr Leu Pro Thr Lys Met Gly Gin Ser Tyr Asn Gin 
285 290 295 300 

aat aat ggg ata ctt atg ttg gta gat cag agt tct age age aac tat 
Asn Asn Gly He Leu Met Leu Val Asp Gin Ser Ser Ser Ser Asn Tyr 
305 310 315 

aat aca ttt ctg cct caa aat ttg gat tat agt tat gat caa aac cct 
Asn Thr Phe Leu Pro Gin Asn Leu Asp Tyr Ser Tyr Asp Gin Asn Pro 
320 325 330 

ttt cat gac caa acc tta tat gta gtc acc gac aaa aat ttc ccc aaa 
Phe HiB Asp Gin Thr Leu Tyr Val Val Thr Asp Lys Asn Phe Pro Lys 
335 340 345 

ggt ttc eta taa atctcgacag ttttgaagga etatgeatga tcaagtttaa 
Gly Phe Leu 
350 

acatgtaagc caatatagtc ccttattcct ctgaatgtat acaaaatcta tagttatgta 
tatctgttcc tttttaacgt atctttattg atcttctgtg ccttgatcaa aattgtcatt 
ttaagattca gtttgtgtaa tattttagct acaactttta agtggtatta ttgtaacctt 
ttgaactata tattttgaag atgaataaga acatgtttat ataaaaa 



712 

760 

808 

856 

904 

952 

1000 

1048 

1096 

1144 

1192 

1240 

1292 

1352 
1412 
14 72 
1519 



<210> 28 
<211> 351 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 28 

Met Ser Ser Ser Thr Asn Asp Tyr ABn Asp Gly Asn Asn Asn Gly Val 
15 10 15 



Tyr Pro Leu Ser Leu Tyr Leu Ser Ser Leu Ser Gly His Gin Asp ' He 
20 25 30 
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He His Asn Pro Tyr Asn His Gin Leu Lys Ala Ser Pro Gly His Met 
35 40 45 

Val Ser Ala Val Pro Glu Ser Leu He Asp Tyr Met Ala Phe Lys Ser 
50 55 60 

Asn Asn val Val Asn Gin Gin Gly Phe Glu Phe Pro Glu Val Ser Lys 
65 70 75 80 

Glu He Lys Lys Val Val Lys Lys Asp Arg His Ser Lys He Gin Thr 
85 90 95 

Ala Gin Gly lie Arg Asp. Arg Arg Val Arg Leu Phe He Gly He Ala 
100 105 110 



Arg Gin Phe Phe Asp Leu Gin Asp Met Leu Gly Phe Asp Lys Ala Ser 
115 120 125 

Lys Thr Leu Asp Trp Leu Leu Lys Lys Ser Arg Lys Ala He Lys Glu 
130 135 140 

Val Val Gin Ala Lys Asn Leu Asn Asn Asp Asp Glu Asp Phe Gly Asn 
145 150 .155 160 

He Gly Gly Asp Val Glu Gin Glu Glu Glu Lys Glu Glu Asp Asp Asn 
165 170 175 

Gly Asp Lys Ser Phe Val Tyr Gly Leu Ser Pro Gly Tyr Gly Glu Glu 
180 185 190 

Glu Val Val Cys Glu Ala Thr Lys Ala Gly He Arg Lys Lys Lys Ser 
195 200 205 

Glu Leu Arg Asn He Ser Ser Lys Gly Leu Gly Ala Lys Ala Arg Gly 
210 215 220 

Lys Ala Lys Glu Arg Thr Lys Glu Met Met Ala Tyr Asp Asn Pro Glu 
225 230 235 240 

Thr Ala Ser Asp He Thr Gin Ser Glu He Met Asp Pro Phe Lys Arg 
245 250 255 

Ser He Val Phe Asn Glu Gly Glu Asp Met Thr His Leu Phe Tyr Lys 
260 265 270 

Glu Pro He Glu Glu Phe Asp Asn Gin Glu Ser He Leu Thr Asn Met 
275 280 285 



Thr Leu Pro Thr Lys Met Gly Gin Ser Tyr Asn Gin Asn Asn Gly He 
290 295 300 



Leu Met Leu Val Asp Gin Ser Ser Ser Ser Asn Tyr Asn Thr Phe Leu 
305 310 315 320 

Pro Gin Asn Leu Asp Tyr Ser Tyr Asp Gin Asn Pro Phe His Asp Gin 
325 330 335 
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Thr Leu Tyr Val Val Thr Asp Lys Asn Phe Pro Lys Gly Phe Leu 
340 345 350 



<210> 


29 


<211> 


974 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


<G2) . . (874) 


<223> 


G1073 


<400> 


29 



ccccccgacc tgcctctaca gagacctgaa gattccagaa ccccacctga tcaaaaataa 60 

c atg gaa ctt aac aga tct gaa gca gac gaa gca aag gcc gag acc act 109 
Met Glu Leu Asn Arg Ser Glu Ala Asp Glu.Ala Lys Ala Glu Thr Thr 
15 10 15 

ccc acc ggt gga gcc acc age tea gcc aca gcc tct ggc tct tec tec 157 
Pro Thr Gly Gly Ala Thr Ser Ser Ala Thr Ala Ser Gly Ser Ser Ser 
20 25 30 

gga cgt cgt cca cgt ggt cgt cct gca ggt tec aaa aac aaa ccc aaa 205 

Gly Arg Arg Pro Arg Gly Arg Pro Ala Gly Ser Lys Asn Lys Pro Lys 
35 40 45 

cct ccg acg att ata act aga gat agt cct aac gtc ctt aga tea cac 253 

Pro Pro Thr lie lie Thr Arg Asp Ser Pro Asn Val Leu Arg Ser His 
50 55 60 

gtt ctt gaa gtc acc tec ggt teg gac ata tec gag gca gtc tec acc 301 

Val Leu Glu Val Thr Ser Gly Ser Asp He Ser Glu Ala Val Ser Thr 
65 70 75 80 

tac gcc act cgt cgc ggc tgc ggc gtt tgc att ata age ggc acg ggt 349 
Tyr Ala Thr Arg Arg Gly Cys Gly Val Cys He He Ser Gly Thr Gly 

85 90 95 

gcg gtc act aac gtc acg ata egg caa cct gcg get ccg get ggt gga 397 

Ala Val Thr Asn Val Thr He Arg Gin Pro Ala Ala Pro Ala Gly Gly 
100 105 110 

ggt gtg att acc ctg cat ggt egg ttt gac att ttg tct ttg acc ggt 445 

Gly Val He Thr Leu His Gly Arg Phe Asp He Leu Ser Leu Thr Gly 
115 . 120 125 

act gcg ctt cca ccg cct gca cca ccg gga gca gga ggt ttg acg gtg 493 

Thr Ala Leu Pro Pro Pro Ala Pro Pro Gly Ala Gly Gly Leu Thr Val 
130 135 140 

tat eta gcc gga ggt caa gga caa gtt gta gga ggg aat gtg get ggt 541 

Tyr Leu Ala Gly Gly Gin Gly Gin Val Val Gly Gly Asn Val Ala Gly 
145 150 155 160 

teg tta att get teg gga ccg gta gtg ttg atg get get tct ttt gca 589 

Ser Leu He Ala Ser Gly Pro Val Val Leu Met Ala Ala Ser Phe Ala 

165 170 175 

aac gca gtt tat gat agg tta ccg att gaa gag gaa gaa acc cca ccg 637 

Asn Ala Val Tyr Asp Arg Leu Pro He Glu Glu Glu Glu Thr Pro Pro 
180 185 190 

ccg aga acc acc ggg gtg cag cag cag cag ccg gag gcg tct cag teg 685 

Pro Arg Thr Thr Gly Val Gin Gin Gin Gin Pro Glu Ala Ser Gin Ser 
195 200 205 

teg gag gtt acg ggg agt ggg gcc cag gcg tgt gag tea aac etc caa 733 

Ser Glu Val Thr Gly Ser Gly Ala Gin Ala Cys Glu Ser Asn Leu Gin 
210 215 220 
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99 c 99 a aat ggt gga gga ggt gtt get ttc tac aat ctt gga atg aat 781 
Gly Gly Asn Gly Gly Gly Gly val Ala Phe Tyr Aen Leu Gly Met Aan 
225 230 235 240 

atg aac aat ttt caa ttc tec ggg gga gat att tac ggt atg age ggc 829 
Met Asn Asn Phe Gin Phe Ser Gly Gly Asp lie Tyr Gly Met Ser Gly 

245 250 255 

ggt age gga gga ggt ggt ggc ggt gcg act aga ccc gcg ttt tag 874 
Gly Ser Gly Gly Gly Gly Gly Gly Ala Thr Arg Pro Ala Phe 
260 265 270 

agttttagcg ttttggtgac accttttgtt gcgtttgcgt gtttgacctc aaactactag 934 

gctactagct atagcggttg egaaatgega atattaggtt 974 

<210> 30 
<211> 270 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 30 

Met Glu Leu Asn Arg Ser Glu Ala Asp Glu Ala Lys Ala Glu Thr Thr 
15 10 15 

Pro Thr Gly Gly Ala Thr Ser Ser Ala Thr Ala Ser Gly Ser Ser Ser 
20 25 30 

Gly Arg Arg Pro Arg Gly Arg Pro Ala Gly Ser Lys Asn Lys Pro Lys 
35 40 45 

Pro Pro Thr lie lie Thr Arg Asp Ser Pro Asn Val Leu Arg Ser His 
50 55 60 

val Leu Glu Val Thr Ser Gly Ser Asp He Ser Glu Ala Val Ser Thr 
65 70 75 80 

Tyr Ala Thr Arg Arg Gly Cys Gly Val Cys He He Ser Gly Thr Gly 

85 90 95 

Ala Val Thr Asn Val Thr He Arg Gin Pro Ala Ala Pro Ala Gly Gly 
100 105 110 

Gly Val lie Thr Leu His Gly Arg Phe Asp He Leu Ser Leu Thr Gly 
115 120 125 

Thr Ala Leu Pro Pro Pro Ala Pro Pro Gly Ala Gly Gly Leu Thr Val 
130 135 140 

Tyr Leu Ala Gly Gly Gin Gly Gin Val Val Gly Gly Asn Val Ala Gly 
145 150 155 160 

Ser Leu lie Ala Ser Gly Pro Val Val Leu Met Ala Ala Ser Phe Ala 
165 170 175 

Asn Ala Val Tyr Asp Arg Leu Pro lie Glu Glu Glu Glu Thr Pro Pro 
180 185 190 

Pro Arg Thr Thr Gly Val Gin Gin Gin Gin Pro Glu Ala Ser Gin Ser 
195 200 205 
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Ser Glu Val Thr Gly Ser Gly Ala Gin Ala Cys Glu Ser Asn Leu Gin 
210 215 220 



Gly Gly Asn Gly Gly Gly Gly Val Ala Phe Tyr Asn Leu Gly Met Asn 
225 230 235 240 



Met Asn Asn Phe Gin Phe Ser Gly Gly Asp lie Tyr Gly Met Ser Gly 

245 250 255 



Gly Ser Gly Gly Gly Gly Gly Gly Ala Thr Arg Pro Ala Phe 
260 265 270 



<210> 


31 


<211> 


2010 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2010) 


<223> 


G1493 


<400> 


31 



atg atg aat ccg agt cac gga aga gga etc gga teg get ggt ggg tec 
Met Met Asn Pro Ser His Gly Arg Gly Leu Gly Ser Ala Gly Gly Ser 
15 10 15 

age tec ggt aga aat caa gga ggt ggt ggt gag acc gtc gtc gag atg 
Ser Ser Gly Arg Asn Gin Gly Gly Gly Gly Glu Thr Val Val Glu Met 
20 25 30 

ttt cct tct ggt ctt cga gtt ctt gtc gtt gac gat gac cca act tgt 
Phe Pro Ser Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys 
35 40 45 

etc atg ate tta gag agg atg ctt agg act tgt ctt tac gaa gta acg 
Leu Met lie Leu Glu Arg Met Leu Arg Thr Cys Leu Tyr Glu Val Thr 
50 55 60 

aaa tgc aac aga gca gag atg gca ttg tct ctg etc egg aag aac aaa 
Lys Cys Asn Arg Ala Glu Met Ala Leu Ser Leu Leu Arg Lys Asn Lys 
65 70 75 80 

cat gga ttc gat ata gta ate agt gat gtt cat atg cct gac atg gac 
His Gly Phe Asp He Val He Ser Asp Val His Met Pro Asp Met Asp 
85 90 95 

ggt ttc aag ctt ctt gag cat gtt ggt eta gag atg gac tta cct gtt 
Gly Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val 
100 105 110 

ate atg atg tct gcg gat gat tea aag agt gtg gtt eta aag gga gta 
He Met Met Ser Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val 
115 120 125 

acg cac ggt gcg gtt gat tac ctt ate aag cct gta cgt atg gag gca 
Thr His Gly Ala Val Asp Tyr Leu lie Lys Pro Val Arg Met Glu Ala 
130 135 140 

ctt aag aac ata tgg cag cat gta gtt agg aag agg aga agt gaa tgg 
Leu Lys Asn He Trp Gin His Val Val Arg Lys Arg Arg Ser Glu Trp 
145 150 155 160 

agt gta ccg gaa cat tct ggg age att gag gag act ggc gag aga cag 
Ser Val Pro Glu His Ser Gly Ser He Glu Glu Thr Gly Glu Arg Gin 
165 170 175 

cag cag caa cat aga gga ggt ggt ggt ggt gca get gtt tct ggt gga 
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Gin Gin Gin His Arg Gly Gly Gly Gly Gly Ala Ala Val Ser Gly Gly 
180 185 190 

gag gat gcg gtg gat gat aac tea tec teg gtt aac gaa ggt aac aat 624 
Glu Asp Ala Val Asp Asp Asn Ser Ser Ser Val Asn Glu Gly Asn Asn 
195 200 205 

tgg agg age agt tea egg aag agg aaa gac gag gaa gga gaa gag caa 672 
Trp Arg Ser Ser Ser Arg Lys Arg Lys Asp Glu Glu Gly Glu Glu Gin 
210 215 220 

gga gat gat aag gac gaa gat gcg teg aat ttg aag aaa ccg cgt gtc 720 
Gly Asp Asp Lys Asp Glu Asp Ala Ser Asn Leu Lys Lys Pro Arg Val 
225 230 235 240 

gtc tgg tct gtt gaa ttg cat cag cag ttt gtt get get gtt aat cag 768 
Val Trp Ser Val Glu Leu His~ Gin Gin Phe Val Ala Ala Val Asn Gin 
. . 245" 250 255 

etc ggc gtt gag aag gcg gtt cct aaa aag ate tta gag ctg atg aat 816 
Leu Gly Val Glu Lys Ala Val Pro Lys Lys He Leu Glu Leu Met Asn 
260 265 270 

gtt cct ggt eta acc cga gaa aac gta gca agt cac etc cag aaa tac 864 
Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gin Lys Tyr 
275 280 285 

egg ata tat eta aga egg ctt gga ggg gta teg cag cac caa ggc aat 912 
Arg He Tyr Leu Arg Arg Leu Gly Gly Val Ser Gin His Gin Gly Asn 
290 295 300 

ctt aac aac teg ttt atg acg ggt cag gat gcg age ttc gga cct ctt 960 
Leu Asn Asn Ser Phe Met Thr Gly Gin Asp Ala Ser Phe Gly Pro Leu 
305 310 315 320 

teg aca ttg aat ggg ttt gat ctt caa gca eta gee gtc aca ggt cag 1008 
Ser Thr Leu Asn Gly Phe Asp Leu Gin Ala Leu Ala Val Thr Gly Gin 
325 330 335 

tta cct gca cag agt ctt gca cag ctt caa gec get ggt tta ggc egg 1056 
Leu Pro Ala Gin Ser Leu Ala Gin Leu Gin Ala Ala Gly Leu Gly Arg 
340 345 350 

cct gcg atg gtc tct aag tea ggt ttg ccg gtt tec tec att gtg gat 1104 
Pro Ala Met Val Ser Lys Ser Gly Leu Pro Val Ser Ser He Val Asp 
355 360 365 

gag aga age ate ttc age ttt gac aac acg aaa aca aga ttt gga gaa 1152 
Glu Arg Ser lie Phe Ser Phe Asp Asn Thr Lys Thr Arg Phe Gly Glu 
370 375 380 

ggg ctt ggg cat cac ggg caa caa ccc caa cag caa cca cag atg aac 1200 
Gly Leu Gly His His Gly Gin Gin Pro Gin Gin Gin Pro Gin Met Asn 
385 390 395 400 

tta ctt cac ggt gtc ccc acg ggt tta caa cag cag ctt cct atg ggt 1248 
Leu Leu His Gly Val Pro Thr Gly Leu Gin Gin Gin Leu Pro Met Gly 
405 410 415 

aat cga atg agt att caa caa cag att get get* gtt cga get gga aat 1296 
Asn Arg Met Ser He Gin Gin Gin lie Ala Ala Val Arg Ala Gly Asn 
420 425 430 

agt gtt caa aac aac gga atg ctg atg cct eta gcg ggt cag cag tct 1344 
Ser Val Gin Asn Asn Gly Met Leu Met Pro Leu Ala Gly Gin Gin Ser 
435 440 445 

ttg cct egg gga cca ccg cct atg eta acc tct teg caa tea tec ate 1392 
Leu Pro Arg Gly Pro Pro Pro Met Leu Thr Ser Ser Gin Ser Ser He 
450 455 460 

agg cag ccg atg tta tea aac cgc att tec gag aga agt ggt ttc tct 1440 
Arg Gin Pro Met Leu Ser Asn Arg He Ser Glu Arg Ser Gly Phe Ser 
465 470 475 480 
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gga agg aac aat ate ccc gag age age aga gtg tta ccg aca agt tac 
Gly Arg Asn Asn lie Pro Glu Ser Ser Arg Val Leu Pro Thr Ser Tyr 
485 490 495 



1488 



act aat etc aca aca caa cac tea tea age teg atg cct tat aac aac 

Thr Asn Leu Thr Thr Gin His Ser Ser Ser Ser Met Pro Tyr Asn Asn 
500 505 510 

ttc caa cca gaa ctt ccc gtg aac agt ttc ccg ctg gca agt gca cca 

Phe Gin Pro Glu Leu Pro Val Asn Ser Phe Pro Leu Ala Ser Ala Pro 
515 520 525 

ggg ata tea gta ccg gtt egg aaa gee act tct tac cag gaa gag gtt 

Gly He Ser Val Pro Val Arg Lys Ala Thr Ser Tyr Gin Glu Glu Val 
530 535 540 

aac age tec gaa gcg ggt ttc. att acg ccg age tac gac atg ttc ace 

Asn Ser Ser Glu Ala. Gly Phe He Thr Pro Ser Tyr Asp Met Phe Thr 

545 550 555 560 

acc aga cag aat gat tgg gat ctg agg aat att gga ata gee ttt gac 

Thr Arg Gin Asn Asp Trp Asp Leu Arg Asn He Gly He Ala Phe Asp 

565 570 575 

tea cat cag gac tea gaa tec get gcg ttt tec get tea gaa gee tac 

Ser His Gin Asp Ser Glu Ser Ala Ala Phe Ser Ala Ser Glu Ala Tyr 
580 585 590 

tct tct teg tec atg tea aga cac aac acg aca gtt gca gee acc gag 

Ser Ser Ser Ser Met Ser Arg His Asn Thr Thr Val Ala Ala Thr Glu 
595 600 605 

cat ggc cga aac cac cag cag cca cca teg gga atg gta cag cac cat 

His Gly Arg Asn His Gin Glri Pro Pro Ser Gly Met Val Gin His His 
610 615 620 

cag gtt tat gca gac gga aac ggt ggt tea gtg agg gtg aaa tea gag 

Gin Val Tyr Ala Asp Gly Asn Gly Gly Ser Val Arg Val Lys Ser Glu 

625 630 635 640 

aga gtg get acg gat aca gca aca atg gcg ttt cac gag cag tat agt 

Arg Val Ala Thr Asp Thr Ala Thr Met Ala Phe His Glu Gin Tyr Ser 

645 650 655 

aat caa gaa gat ctt atg age gca ctt ctt aag cag gtt tga 

Asn Gin Glu Asp Leu Met Ser Ala Leu Leu Lya Gin Val 
660 665 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2010 



<210> 32 
<211> 669 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 32 

Met Met Asn Pro Ser His Gly Arg Gly Leu Gly Ser Ala Gly Gly Ser 
15 10 IS 

Ser Ser Gly Arg Asn Gin Gly Gly Gly Gly Glu Thr Val Val Glu Met 
20 25 30 

Phe Pro Ser Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys 
35 40 45 

Leu Met He Leu Glu Arg Met Leu Arg Thr Cys Leu Tyr Glu Val Thr 
50 55 60 



Lys Cys Asn Arg Ala Glu Met Ala Leu Ser Leu Leu Arg Lys Asn Lys 
65 70 75 80 
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His Gly Phe Asp He Val He Ser Asp Val His Met Pro Asp Met Asp 
85 90 95 

Gly Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val 

100 105 110 ■- 

He Met Met Ser Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val 
115 120 125 

Thr His Gly Ala Val Asp Tyr Leu He Lys Pro Val Arg Met Glu Ala 
130 135 140 

Leu Lys Asn He Trp Gin His Val Val Arg Lys Arg Arg Ser Glu Trp 
145 \ ' 150 155 160 

Ser Val Pro Glu His Ser Gly Ser lie Glu Glu Thr Gly Glu Arg Gin 
165 170 175 

Gin Gin Gin His Arg Gly Gly Gly Gly Gly Ala Ala Val Ser Gly Gly 
180 185 190 

Glu Asp Ala Val Asp Asp Asn Ser Ser Ser Val Asn Glu Gly Asn Asn 
195 200 205 

Trp Arg Ser Ser Ser Arg Lys Arg Lys Asp Glu Glu Gly Glu Glu Gin 
210 215 220 

Gly Asp Asp Lys Asp Glu Asp Ala Ser Asn Leu Lys Lys Pro Arg Val 
» 225 230 235 240 

Val Trp Ser Val Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin 
245 250 255 

Leu Gly Val Glu Lys Ala Val Pro Lys Lys He Leu Glu Leu Met Asn 
260 265 270 

Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gin Lys Tyr 
275 280 285 

Arg He Tyr Leu Arg Arg Leu Gly Gly Val Ser Gin His Gin Gly Asn 
290 295 300 

Leu Asn Asn Ser Phe Met Thr Gly Gin Asp Ala Ser Phe Gly Pro Leu 
305 310 315 320 

Ser Thr Leu Asn Gly Phe Asp Leu Gin Ala Leu Ala Val Thr Gly Gin 
325 330 335 

Leu Pro Ala Gin Ser Leu Ala Gin Leu Gin Ala Ala Gly Leu Gly Arg 
340 345 350 

Pro Ala Met Val Ser Lys Ser Gly Leu Pro Val Ser Ser lie Val Asp 
355 360 365 

Glu Arg Ser He Phe Ser Phe Asp Asn Thr Lys Thr Arg Phe Gly Glu 
370 375 380 
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Gly Leu Gly His His Gly Gin Gin Pro Gin Gin Gin Pro Gin Met Asn 

385 390 395 400 

Leu Leu His Gly Val Pro Thr Gly Leu Gin Gin Gin Leu Pro Met Gly 

405 410 415 

Asn Arg Met Ser He Gin Gin Gin He Ala Ala Val Arg Ala Gly Asn 

420 425 430 

Ser Val Gin Asn Asn Gly Met Leu Met Pro Leu Ala Gly Gin Gin Ser 

435 440 445 

Leu Pro Arg Gly Pro" Pro Pro Met Leu Thr Ser Ser Gin Ser Ser He 
450. 4S5 460 

Arg Gin Pro Met Leu Ser Asn Arg He Ser Glu Arg Ser Gly Phe Ser 

465 470 475 480 

Gly Arg Asn Asn He Pro Glu Ser Ser Arg Val Leu Pro Thr Ser Tyr 

485 490 495 



Thr Asn Leu Thr Thr Gin His Ser Ser Ser Ser Met Pro Tyr Asn Asn 
500 505 510 

Phe Gin Pro Glu Leu Pro Val Asn Ser Phe Pro Leu Ala Ser Ala Pro 
515 520 525 

Gly He Ser Val Pro Val Arg Lys Ala Thr Ser Tyr Gin Glu Glu Val 
530 535 540 

Asn Ser Ser Glu Ala Gly Phe He Thr Pro Ser Tyr Asp Met Phe Thr 
545 550 555 560 

Thr Arg Gin Asn Asp Trp Asp Leu Arg Asn He Gly He Ala Phe Asp 
565 570 575 

Ser His Gin Asp Ser Glu Ser Ala Ala Phe Ser Ala Ser Glu Ala Tyr 
580 585 590 

Ser Ser Ser Ser Met Ser Arg His Asn Thr Thr val Ala Ala Thr Glu 
595 600 605 

His Gly Arg Asn His Gin Gin Pro Pro Ser Gly Met Val Gin His Hie 
610 615 620 

Gin Val Tyr Ala Asp Gly Asn Gly Gly Ser Val Arg Val Lys Ser Glu 
625 630 635 640 

Arg Val Ala Thr Asp Thr Ala Thr Met Ala Phe His Glu Gin Tyr Ser 
645 650 655 

Asn Gin Glu Asp Leu Met Ser Ala Leu Leu Lys Gin Val 
660 665 



<210> 33 
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<211> 1239 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (6) . . (1091) 

<223> G993 

<:400> 33 

caaat atg gaa tac age tgt gta gac gac agt agt aca acg tea gaa tct 50 

Met Glu Tyr Ser Cys Val Asp Asp Ser Ser Thr Thr Ser Glu Ser 
15 10 15 

etc tec ate tct act act cca aag ccg aca acg acg acg gag aag aaa 98 
Leu Ser lie Ser Thr Thr Pro Lys Pro Thr Thr Thr Thr Glu Lys Lys 
20 25 30 

etc tct tct ccg ccg gcg acg teg atg cgt etc tac aga atg gga age 146 
Leu Ser Ser Pro Pro Ala Thr Ser Met Arg Leu Tyr Arg Met Gly Ser 
35 40 45 

ggc gga age age gtc gtt ttg gat tea gag aac ggc gtc gag ace gag 194 
Gly Gly Ser Ser Val Val Leu Asp Ser Glu Asn Gly Val Glu Thr Glu 
50 55 60 

tea cgt aag ctt cct teg teg aaa tat aaa ggc gtt gtg cct cag cct 242 
Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gin Pro 
65 70 75 

aac gga aga tgg gga get cag att tac gag aag cat cag cga gtt tgg 290 
Asn Gly Arg Trp Gly Ala Gin lie Tyr Glu Lys His Gin Arg Val Trp 
80 85 90 95 

etc ggt act ttc aac gag gaa gaa gaa get gcg tct tct tac gac ate 338 
Leu Gly Thr Phe Asn Glu Glu Glu Glu Ala Ala Ser Ser Tyr Asp lie 
100 105 110 

gee gtg agg aga ttc cgc ggc cgc gac gec gtc act aac ttc aaa tct 386 
Ala Val Arg Arg Phe Arg Gly Arg Asp Ala Val Thr Asn Phe Lys Ser 
115 120 125 

caa gtt gat gga aac gac gec gaa teg get ttt ctt gac get cat tct 434 
Gin Val Asp Gly Asn Asp Ala Glu Ser Ala Phe Leu Asp Ala His Ser 
130 135 140 

aaa get gag ate gtg gat atg ttg agg aaa cac act tac gee gat gag 482 
Lys Ala Glu lie Val Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu 
145 150 155 

ttt gag cag agt aga egg aag ttt gtt aac ggc gac gga aaa cgc tct 530 
Phe Glu Gin Ser Arg Arg Lys Phe Val Asn Gly Asp Gly Lys Arg Ser 
160 165 170 175 

ggg ttg gag acg gcg acg tac gga aac gac get gtt ttg aga gcg cgt 578 
Gly Leu Glu Thr Ala Thr Tyr Gly Asn Asp Ala Val Leu Arg Ala Arg 
180 185 190 

gag gtt ttg ttc gag aag act gtt acg ccg age gac gtc ggg aag ctg 626 
Glu Val Leu Phe Glu Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu 
195 200 205 

aac cgt tta gtg ata ccg aaa caa cac gcg gag aag cat ttt ccg tta 674 
Asn Arg Leu Val lie Pro Lys Gin His Ala Glu Lys His Phe Pro Leu 
210 215 220 

ccg gcg atg acg acg gcg atg ggg atg aat ccg tct ccg acg aaa ggc 722 
Pro Ala Met Thr Thr Ala Met Gly Met Asn Pro Ser Pro Thr Lys Gly 
225 230 235 

gtt ttg att aac ttg gaa gat aga aca ggg aaa gtg tgg egg ttc cgt 770 
Val Leu lie Asn Leu Glu Asp Arg Thr Gly Lys Val Trp Arg Phe Arg 
240 245 250 255 
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tac agt tac tgg aac age agt caa agt tac gtg ttg acc aag ggc tgg 818 
Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly Trp 
260 265 270 

age egg ttc gtc aaa gag aag aat ctt cga gee ggt gat gtg gtt tgt 866 
Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val Cys 
275 280 285 

ttc gag aga tea acc gga cca gac egg caa ttg tat ate cac tgg aaa 914 
Phe Glu Arg Ser Thr Gly Pro Asp Arg Gin Leu Tyr lie His Trp Lys 
290 295 300 

gtc egg tct agt ccg gtt cag act gtg gtt agg eta ttc gga gtc aac 962 
Val Arg Ser Ser Pro Val Gin Thr Val Val Arg Leu Phe Gly Val Asn 
305 310 315 

att ttc aat gtg agt aac gag aaa cca aac gac gtc gca gta gag tgt 1010 
lie Phe Asn Val Ser Asn Glu- Lys Pro Asn Asp Val Ala Val Glu Cys 
320 325 ' • 330 335 

gtt ggc aag aag aga tct egg gaa gat gat ttg ttt teg tta ggg tgt 1058 
Val Gly Lys Lys Arg Ser Arg Glu Asp Asp Leu Phe Ser Leu Gly Cys 
340 345 350 

tec aag aag cag gcg att ate aac ate ttg tga caaattcttt ttttttggtt 1111 
Ser Lys Lys Gin Ala lie lie Asn lie Leu 
355 360 

tttttcttca atttgtttct cctttttcaa tattttgtat tgaaatgaca agttgtaaat 1171 

taggacaaga caagaaaaaa tgacaactag acaaaatagt ttttgtttaa aaaaaaaaaa 1231 

aaaaaaaa 123 9 

<210> 34 

<211> 361 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 34 

Met Glu Tyr Ser Cys Val Asp Asp Ser Ser Thr Thr Ser Glu Ser Leu 
1 5 10 !5 

Ser He Ser Thr Thr Pro Lys Pro Thr Thr Thr Thr Glu Lys Lys Leu 
20 25 30 

Ser Ser Pro Pro Ala Thr Ser Met Arg Leu Tyr Arg Met Gly Ser Gly 
35 - 40 45 

Gly Ser Ser Val Val Leu Asp Ser Glu Asn Gly Val Glu Thr Glu Ser 
50 55 60 

Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gin Pro Asn 
65 70 75 80 

Gly Arg Trp Gly Ala Gin He Tyr Glu Lys His Gin Arg Val Trp Leu 
85 90 95 

Gly Thr Phe Asn Glu Glu Glu Glu Ala Ala Ser Ser Tyr Asp He Ala 
100 105 110 

Val Arg Arg Phe Arg Gly Arg Asp Ala Val Thr Asn Phe Lys Ser Gin 
115 120 125 



Val Asp Gly Asn Asp Ala Glu Ser Ala 



Phe Leu Asp Ala His Ser Lys 
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130 135 140 



\ 



Ala Glu lie Val Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu Phe 
145 150 155 160 



Glu Gin Ser Arg Arg Lys Phe Val Asn Gly Asp Gly Lys Arg Ser Gly 
165 170 175 



Leu Glu Thr Ala Thr Tyr Gly Asn Asp Ala Val Leu Arg Ala Arg Glu 
180 185 190 



Val Leu Phe Glu Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu Asn 
195 200 205 



Arg Leu Val lie Pro Lys Gin His Ala Glu Lys His Phe Pro Leu Pro 
210 215 220 



Ala Met Thr Thr Ala Met Gly Met Asn Pro Ser Pro Thr Lys Gly Val 
225 230 235 240 



Leu lie Asn Leu Glu Asp Arg Thr Gly Lys Val Trp Arg Phe Arg Tyr 
245 250 255 



Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly Trp Ser 
260 265 270 



Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val Cys Phe 
275 280 285 



Glu Arg Ser. Thr Gly Pro Asp Arg Gin Leu Tyr lie His Trp Lys Val 
290 295 300 



Arg Ser Ser Pro Val Gin Thr Val Val Arg Leu Phe Gly Val Asn He 
305 310 315 320 



Phe Asn Val Ser Asn Glu Lys Pro Asn Asp Val Ala val Glu Cys Val 
325 330 335 



Gly Lys Lys Arg Ser Arg Glu Asp Asp Leu Phe Ser Leu Gly Cys Ser 
340 345 350 



Lys Lys Gin Ala He He Asn He Leu 
355 360 



<210> 


35 


<211> 


1281 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


c222> 


(64) . . (1098) 


c223> 


G867 


<400> 


35 



cacaacacaa acacatttct gttttctcca ttgtttcaaa ccataaaaaa aaacacagat 

taa atg gaa teg agt age gtt gat gag agt act aca agt aca ggt tec 
Met Glu Ser Ser Ser Val Asp Glu Ser Thr Thr Ser Thr Gly Ser 
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10 



15 



ate tgt gaa acc ccg gcg ata act ccg gcg aaa aag teg teg gta ggt 
He Cys Glu Thr Pro Ala He Thr Pro Ala Lys Lys Ser Ser Val Gly 
20 25 30 



aaa cac cag cgc gtg tgg. etc ggg aca ttc aac gaa gaa gac gaa gec 
Lys His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala 
80 85 90 95 

get cgt gee tac gac gtc gcg gtt cac agg ttc cgt cgc cgt gac gee 
Ala Arg Ala Tyr Asp Val Ala Val His Arg Phe Arg Arg Arg Asp Ala 

100 105 110 

gtc aca aat ttc aaa gac gtg aag atg gac gaa gac gag gtc gat ttc 
Val Thr Asn Phe Lys Asp Val Lys Met Asp Glu Asp Glu Val Asp Phe 
115 120 125 

ttg aat tct cat teg aaa tct gag ate gtt gat atg ttg agg aaa cat 
Leu Asn Ser His Ser Lys Ser Glu He Val Asp Met Leu Arg Lys His 
130 135 140 

act tat aac gaa gag tta gag cag agt aaa egg cgt cgt aat ggt aac 

Thr Tyr Asn Glu Glu Leu Glu Gin Ser Lys Arg Arg Arg Asn Gly Asn 

145 150 155 

i 

gga aac atg act agg acg ttg tta acg teg ggg ttg agt aat gat ggt 
Gly Asn Met Thr Arg Thr Leu Leu Thr Ser Gly Leu Ser Asn Asp Gly 
160 165 170 175 

gtt tct acg acg ggg ttt aga teg gcg gag gca ctg ttt gag aaa gcg 
Val Ser Thr Thr Gly Phe Arg Ser Ala Glu Ala Leu Phe Glu Lys Ala 
180 185 190 



gta acg 
Val Thr 



cat cac 
His His 



aaa gga 
Lys Gly 
225 

ttc cgt 
Phe Arg 
240 

ggt tgg 
Gly Trp 



cca age 
Pro Ser 
195 

gca gag 
Ala Glu 
210 

gtg ttg 
Val Leu 



gtt agt 
Val Ser 



tgg aag 
Trp Lys 



tac teg 
Tyr Ser 



age agg 
Ser Arg 



ttc agt 
Phe Ser 
275 

teg aga 
Ser Arg 
290 



gac gtt 
Asp Val 



aaa cat 
Lys His 



ttg aac 
Leu Asn 



tat tgg 
Tyr Trp 
245 

ttc gtt 
Phe Val 
260 

aga tct 
Arg Ser 



ggg aag 
Gly Lys 



ttt ccg 
Phe Pro 
215 

ttt gag 
Phe Glu 
230 

aac agt 
Asn Ser 



aag gag 
Lys Glu 



tec ggg 
Ser Gly 



aac ggt 
Asn Gly 



tea gat 
Ser Asp 
295 



eta aac cgt ttg 
Leu Asn Arg Leu 
200 

tta ccg tea agt 
Leu Pro Ser Ser 



gac gtt aac ggg 
Asp Val Asn Gly 
235 

agt cag agt tat 
Ser Gin Ser Tyr 
250 

aag aat eta cgt 
Lys Asn Leu Arg 
265 

cag gat caa cag 
Gin Asp Gin Gin 
280 

tta gat gcg ggt 
Leu Asp Ala Gly 



156 



aac tta tac agg atg gga age gga tea age gtt gtg tta gat tea gag 204 
Asn Leu Tyr Arg Met Gly Ser Gly Ser Ser Val Val Leu Asp Ser Glu 
35 40 45 

aac ggc gta gaa get gaa tct agg aag ctt ccg teg tea aaa tac aaa 252 
Asn Gly Val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys 
50 55 60 

ggt gtg gtg cca caa cca aac gga aga tgg gga get cag att tac gag 3 00 

Gly Val val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin He Tyr Glu 
65 70 75 



348 



396 



444 



492 



540 



588 



636 



684 



732 



ttg ttc gga gtt aac att tea ccg 



gag agt tea aga 
Page 59 



gtt ata ccg aaa 
Val He Pro Lys 
205 

aac gtt tec gtg 
Asn Val Ser Val 
220 



aaa gtg tgg agg 780 
Lys Val Trp Arg 



gtt ttg act aaa 828 
val Leu Thr Lys 
255 

get ggt gac gtg 876 
Ala Gly Asp Val 
270 

ttg tac att ggg 924 
Leu Tyr He Gly 
285 

egg gtt ttg aga 972 

Arg Val Leu Arg 

300 

aac gac gtc gta 1020 
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Leu Phe Gly Val Asn He Ser Pro Glu Ser Ser Arg Asn Asp Val Val 
305 310 315 

gga aac aaa aga gtg aac gat act gag atg tta teg ttg gtg tgt age 1068 
Gly Asn Lys Arg Val Asn Asp Thr Glu Met Leu Ser Leu Val Cys Ser 
320 325 330 335 

aag aag caa cgc ate ttt cac gee teg taa caactcttct tctttttttt 1118 
Lys Lys Gin Arg He Phe His Ala Ser 

340 

tcttttgttg ttttaataat ttttaaaaac tccattttcg ttttctttat ttgcatcggt 1178 

ttctttcttc ttgtttacca aaggttcatg agttgttttt gttgtattga tgaactgtaa 1238 

attttattta taggataaat tttaaaaaaa aaaaaaaaaa aaa 1281 

<210> 3 6 

<211> 344 

<212> PRT 

<213> Arabidopsis thaliana 

<4O0> 36 

Met Glu Ser Ser Ser Val Asp Glu Ser Thr Thr Ser Thr Gly Ser He 
1.5 10 15 

Cys Glu Thr Pro Ala He Thr Pro Ala Lys Lys Ser Ser Val Gly Asn 
20 25 30 

Leu Tyr Arg Met Gly Ser Gly Ser Ser Val Val Leu Asp Ser Glu Asn 
35 40 45 

Gly Val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly 
50 55 60 

Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin He Tyr Glu Lys 
65 70 75 80 

His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala 

85 90 95 

Arg Ala Tyr Asp Val Ala Val His Arg Phe Arg Arg Arg Asp Ala Val 
100 105 110 

Thr Asn Phe Lys Asp Val Lys Met Asp Glu Asp Glu Val Asp Phe Leu 
115 120 125 

Asn Ser His Ser Lys Ser Glu He Val Asp Met Leu Arg Lys His Thr 
130 135 140 

Tyr Asn Glu Glu Leu Glu Gin Ser Lys Arg Arg Arg Asn Gly Asn Gly 
145 150 155 160 

Asn Met Thr Arg Thr Leu Leu Thr Ser Gly Leu Ser Asn Asp Gly Val 

165 170 175 

Ser Thr Thr Gly Phe Arg Ser Ala Glu Ala Leu Phe Glu Lys Ala Val 
180 185 190 

Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val lie Pro Lys His 
195 200 205 
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His Ala Glu Lys His Phe Pro Leu Pro Ser Ser Asn Val Ser Val Lys 
210 215 220 



Gly Val Leu Leu Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg Phe 
225 230 235 240 

Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly 
245 250 255 

Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val 
260 265 270 



Ser Phe Ser Arg Ser Asn Gly Gin Asp Gin Gin Leu Tyr lie Gly Trp 
275 280 285 



Lys Ser Arg Ser Gly Ser Asp Leu Asp Ala Gly Arg Val Leu Arg Leu 
290 295 300 

Phe Gly Val Asn lie Ser Pro Glu Ser Ser Arg Asn Asp Val Val Gly 

305 310 315 320 

Asn Lys Arg Val Asn Asp Thr Glu Met Leu Ser Leu Val Cys Ser Lys 
325 330 335 



Lys Gin Arg lie Phe His Ala Ser 
340 



<210> 


37 


<211> 


1155 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(76) . . (1077) 


<223> 


G1930 


<400> 


37 



attcacatta ctaatctctc aagatttcac aattttcttg tgattttctc tcagtttctt 60 



atttcgtttc ataac atg gat gcc atg agt age gta gac gag age tct aca 

Met Asp Ala Met Ser Ser Val Asp Glu Ser Ser Thr 
15 10 



eta tat aga atg gga age gga aca age gtg gta ctt gat tea gag aac 
Leu Tyr Arg Met Gly Ser Gly Thr Ser Val Val Leu Asp Ser Glu Asn 
30 35 40 



aga ttc aaa ggt gtt gtt cct caa cca aat gga aga tgg gga get cag 
Arg Phe Lys Gly Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin 
65 70 75 



111 



act aca gat tec att ccg gcg aga aag tea teg tct ccg gcg agt tta 159 
Thr Thr Asp Ser lie Pro Ala Arg Lys Ser Ser Ser Pro Ala Ser Leu 
15 20 25 



207 



ggt gtc gaa gtc gaa gtc gaa gcc gaa tea aga aag ctt cct tct tea 255 
Gly Val Glu Val Glu Val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser 
45 50 55 60 



303 



att tac gag aaa cat caa cgc gtg tgg ctt ggt act ttc aac gag gaa 3 51 

He Tyr Glu Lys His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu 
80 85 90 
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gac gaa gca get cgt get tac gac gtc gcg get cac cgt ttc cgt ggc 399 
Asp Glu Ala Ala Arg Ala Tyr Asp Val Ala Ala His Arg Phe Arg Gly 
95 100 105 

cgc gat gec gtt act aat ttc aaa gac acg acg ttc gaa gaa gag gtt 447 
Arg Asp Ala Val Thr Asn Phe Lys Asp Thr Thr Phe Glu Glu Glu Val *- 
110 115 120 

gag ttc tta aac gcg cat teg aaa tea gag ate gta gat atg ttg aga 495 
Glu Phe Leu Asn Ala His Ser Lye Ser Glu lie Val Asp Met Leu Arg 
125 130 135 140 

aaa cac act tac aaa gaa gag tta gac caa agg aaa cgt aac cgt gac 543 
Lys His Thr Tyr Lys Glu Glu Leu Asp Gin Arg Lys Arg Asn Arg Asp 
145 150 155 

ggt aac gga aaa gag acg acg gcg ttt get ttg get teg atg gtg gtt 591 
Gly Asn Gly Lys Glu Thr Thr Ala Phe Ala Leu Ala Ser Met Val Val 
160 165 170 

atg acg ggg ttt aaa acg gcg gag tta ctg ttt gag aaa acg gta acg 63 9 

Met Thr Gly Phe Lys Thr Ala Glu Leu Leu Phe Glu Lys Thr Val Thr 
175 180 185 

cca agt gac gtc ggg aaa eta aac cgt tta gtt ata cca aaa cac caa 687 
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val lie Pro Lys His Gin 
190 195 200 

gcg gag aaa cat ttt ccg tta ccg tta ggt aat aat aac gtc tec gtt 735 
Ala Glu Lys His Phe Pro Leu Pro Leu Gly Asn Asn Asn Val Ser Val 
205 210 215 220 

aaa ggt atg ctg ttg aat ttc gaa gac gtt aac ggg aaa gtg tgg agg 783 
Lys Gly Met Leu Leu Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg 
225 230 235 

ttc cgt tac tct tat tgg aat agt agt caa agt tat gtg ttg acc aaa 831 
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys 
240 245 250 

99t tgg agt aga ttc gtt aaa gag aag aga ctt tgt get ggt gat ttg 879 
Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Cys Ala Gly Asp Leu 
255 260 . 265 

ate agt ttt aaa aga tec aac gat caa gat caa aaa ttc ttt ate ggg 927 
lie Ser Phe Lys Arg Ser Asn Asp Gin Asp Gin Lys Phe Phe lie Gly 
270 275 280 

tgg aaa teg aaa tec ggg ttg gat eta gag acg ggt egg gtt atg aga 975 
Trp Lys Ser Lys Ser Gly Leu Asp Leu Glu Thr Gly Arg Val Met Arg 
285 290 295 300 

ttg ttt ggg gtt gat att tct tta aac gec gtc gtt gta gtg aag gaa 1023 
Leu Phe Gly Val Asp He Ser Leu Asn Ala Val Val Val Val Lys Glu 
305 310 315 

aca acg gag gtg tta atg teg teg tta agg tgt aag aag caa cga gtt 1071 
Thr Thr Glu Val Leu Met Ser Ser Leu Arg Cys Lys Lys Gin Arg Val 
320 325 330 

ttg taa taacaattta acaacttggg aaagaaaaaa aagctttttg attttaattt 1127 
Leu 

ctcttcaacg ttaatcttgc tgagatta 1155 

<210> 38 

<211> 333 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 38 
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Met Asp Ala Met Ser Ser Val Asp Glu Ser Ser Thr Thr Thr Asp Ser 
15 10 15 

lie Pro Ala Arg Lys Ser Ser Ser Pro Ala Ser Leu Leu Tyr Arg Met 
20 25 30 

Gly Ser Gly Thr Ser Val Val Leu Asp Ser Glu Asn Gly Val Glu Val 
35 40 45 

Glu Val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Arg Phe Lys Gly 
50 55 60 

Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin lie Tyr Glu Lys 
65 70 75 80 

His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala 
85 90 95 

Arg Ala Tyr Asp Val Ala Ala His Arg Phe Arg Gly Arg Asp Ala Val 
100 105 110 

Thr Asn Phe Lys Asp Thr Thr Phe Glu Glu Glu Val Glu Phe Leu Asn 
115 120 125 

Ala His Ser Lys Ser Glu lie Val Asp Met Leu Arg Lys His Thr Tyr 
130 135 140 

Lys Glu Glu Leu Asp Gin Arg Lys Arg Asn Arg Asp Gly Asn Gly Lys 
145 150 155 160 

Glu Thr Thr Ala Phe Ala Leu Ala Ser Met Val Val Met Thr Gly Phe 
165 170 175 

Lys Thr Ala Glu Leu Leu Phe Glu Lys Thr Val Thr Pro Ser Asp Val 
180 185 190 

Gly Lys Leu Asn Arg Leu Val He Pro Lys His Gin Ala Glu Lys His 
195 200 205 

Phe Pro Leu Pro Leu Gly Asn Asn Asn Val Ser Val Lys Gly Met Leu 
210 215 220 

Leu Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg Phe Arg Tyr Ser 
225 230 235 240 

Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg 
245 250 255 

Phe Val Lys Glu Lys Arg Leu Cys Ala Gly Asp Leu He Ser Phe Lys 
260 265 270 

Arg Ser Asn Asp Gin Asp Gin Lys Phe Phe He Gly Trp Lys Ser Lys 
275 280 285 

Ser Gly Leu Asp Leu Glu Thr Gly Arg Val Met Arg Leu Phe Gly Val 
290 295 300 
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Asp lie Ser Leu Asn Ala Val Val Val Val Lys Glu Thr Thr Glu Val 
305 310 315 320 

Leu Met Ser Ser Leu Arg Cys Lys Lys Gin Arg Val Leu 
325 330 



<210> 


39 


<211> 


1035 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (1035) 


<223> 


G1594 


<400> 


39 



atg tac aat ttc cat teg gec ggt gat tat tea gat aag teg gtt ctg 48 
Met Tyr Asn Phe His Ser Ala Gly Asp Tyr Ser Asp Lys Ser Val Leu 
15 10 15 

atg atg tea ccg gag agt etc atg ttt cct tec gat tac caa get ttg 96 
Met Met Ser Pro Glu Ser Leu Met Phe Pro Ser Asp Tyr Gin Ala Leu 
20 25 30 

eta tgt tec tec gee ggt gaa aat cgt gtc tct gat gtt ttc gga tec 144 
Leu Cys Ser Ser Ala Gly Glu Asn Arg Val Ser Asp Val Phe Gly Ser 
35 40 45 

gac gag eta etc tea gta gee gtc tec get ttg teg teg gag gcg get 192 
Asp Glu Leu Leu Ser Val Ala Val Ser Ala Leu Ser Ser Glu Ala Ala 
50 55 60 

teg ate get ccg gag ate cga aga aat gat gat aac gtt tct eta act 240 
Ser lie Ala Pro Glu lie Arg Arg Asn Asp Asp Asn Val Ser Leu Thr 
65 70 75 80 

gtc ate aaa get aaa ate get tgt cat cct teg tat cct cgc tta ctt 288 
Val lie Lys Ala Lys lie Ala Cys His Pro Ser Tyr Pro Arg Leu Leu 
85 90 95 

caa get tac ate gat tgc caa aag gtc gga gca cca ccg gag ata gcg 336 
Gin Ala Tyr lie Asp Cys Gin Lys Val Gly Ala Pro Pro Glu lie Ala 
100 105 110 

tgt tta eta gag gag att caa egg gag agt gat gtt tat aag caa gag 3 84 

Cys Leu Leu Glu Glu lie Gin Arg Glu Ser Asp Val Tyr Lys Gin Glu 
115 120 125 

gtt gtt cct tct tct tgc ttt gga get gat cct gag ctt gat gaa ttt 432 
Val Val Pro Ser Ser Cys Phe Gly Ala Asp Pro Glu Leu Asp Glu Phe 
130 135 140 

atg gaa acg tac tgc gat ata tta gtg aaa tac aaa teg gat eta gca 480 
Met Glu Thr Tyr Cys Asp lie Leu Val Lys Tyr Lys Ser Asp Leu Ala 
145 150 155 160 

aga ccg ttt gac gag gca acg tgt ttc ttg aac aag att gag atg cag 528 
Arg Pro Phe Asp Glu Ala Thr Cys Phe Leu Asn Lys lie Glu Met Gin 
165 170 175 

eta egg aac eta tgt act ggt gtc gag tct gee agg gga gtt tct ggg 576 
Leu Arg Asn Leu Cys Thr Gly Val Glu Ser Ala Arg Gly Val Ser Gly 
180 185 190 

ggg atg tct cct cat ggg gac aag act att agt cct etc ctg aca aat 624 
Gly Met Ser Pro His Gly Asp Lys Thr lie Ser Pro Leu Leu Thr Asn 
195 200 205 

gac aat gga gag gat ggt gta ata tea tct gac gag gaa ctg agt gga 672 
Asp Asn Gly Glu Asp Gly Val lie Ser Ser Asp Glu Glu Leu Ser Gly 
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210 215 220 

ggt gat cat gag gta gca gag gat ggg aga caa aga tgt gaa gac egg 720 
Gly Asp His Glu Val Ala Glu Asp Gly Arg Gin Arg Cys Glu Asp Arg 
225 230 235 240 

gac etc aaa gat agg ttg eta cgc aaa ttt gga age cgt att agt act v 768 
Asp Leu Lys Asp Arg Leu Leu Arg Lys Phe Gly Ser Arg lie Ser Thr 
245 250 255 

tta aag ctt gag ttc tea aag aag aag aag aaa gga aag tta cca aga 816 
Leu Lys Leu Glu Phe Ser Lys Lys Lys Lys Lys Gly Lys Leu Pro Arg 
260 265 270 

gaa gca aga caa get ctt ctt gat tgg tgg aat etc cat tat aag tgg 864 
Glu Ala Arg Gin Ala Leu Leu Asp Trp Trp Asn Leu His Tyr Lys Trp 
275 280 285 

cct tac cct act gaa, gga gat aag ata gca tta get gat gca acg ggg 912 
Pro Tyr Pro Thr Glu Gly Asp Lys lie Ala Leu Ala Asp Ala Thr Gly 
290 295 300 

tta gac caa aaa caa ate aac aat tgg ttt ata aac caa agg aaa cgt 960 
Leu Asp Gin Lys Gin He Asn Asn Trp Phe He Asn Gin Arg Lys Arg 
305 310 315 320 

cat tgg aag cca tea gag aat atg cct ttc get atg atg gat gat tct 1008 
His Trp Lys Pro Ser Glu Asn Met Pro Phe Ala Met Met Asp Asp Ser 
325 330 335 



agt gga tea ttc ttt ace gag gaa tga 
Ser Gly Ser Phe Phe Thr Glu Glu 
340 



<210> 40 
<211> 344 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 40 

Met Tyr Asn Phe His Ser Ala Gly Asp Tyr Ser Asp Lys Ser Val Leu 
15 10 15 

Met Met Ser Pro Glu Ser Leu Met Phe Pro Ser Asp Tyr Gin Ala Leu 
20 25 30 

Leu Cys Ser Ser Ala Gly Glu Asn Arg Val Ser Asp Val Phe Gly Ser 
35 . 40 45 

Asp Glu Leu Leu Ser Val Ala Val Ser Ala Leu Ser Ser Glu Ala Ala 
50 55 60 

Ser He Ala Pro Glu He Arg Arg Asn Asp Asp Asn Val Ser Leu Thr 
65 70 75 80 

Val He Lys Ala Lys He Ala Cys His Pro Ser Tyr Pro Arg Leu Leu 
85 90 95 

Gin Ala Tyr He Asp Cys Gin Lys Val Gly Ala Pro Pro Glu He Ala 
100 105 110 

Cys Leu Leu Glu Glu He Gin Arg Glu Ser Asp Val Tyr Lys Gin Glu 
115 120 125 



1035 



Val Val Pro Ser Ser Cys Phe Gly Ala Asp Pro Glu Leu Asp Glu Phe 
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130 135 140 



Met Glu Thr Tyr Cys Asp lie Leu Val Lys Tyr Lys Ser Asp Leu Ala 
145 150 155 160 



Arg Pro Phe Asp Glu Ala Thr Cys Phe Leu Asn Lys He Glu Met Gin 
165 170 175 



Leu Arg Asn Leu Cys Thr Gly Val Glu Ser Ala Arg Gly Val Ser Gly 
180 185 190 



Gly Met Ser Pro His Gly Asp Lys Thr He Ser Pro Leu Leu Thr Asn 
195 200 205 



Asp Asn Gly Glu Asp Gly Val He Ser Ser Asp Glu Glu Leu Ser Gly 
210 215 220 



Gly Asp His Glu Val Ala Glu Asp Gly Arg Gin Arg Cys Glu Asp Arg 
225 230 235 240 



Asp Leu Lys Asp Arg Leu Leu Arg Lys Phe Gly Ser Arg He Ser Thr 
245 250 255 



Leu Lys Leu Glu Phe Ser Lys Lys Lys Lys Lys Gly Lys Leu Pro Arg 
260 265 270 



Glu Ala Arg Gin Ala Leu Leu Asp Trp Trp Asn Leu His Tyr Lys Trp 
275 280 285 



Pro Tyr Pro Thr Glu Gly Asp Lys He Ala Leu Ala Asp Ala Thr Gly 
290 295 300 



Leu Asp Gin Lys Gin He Asn Asn Trp Phe He Asn Gin Arg Lys Arg 
305 310 315 320 



His Trp Lys Pro Ser Glu Asn Met Pro Phe Ala Met Met Asp Asp Ser 
325 330 335 



Ser Gly Ser Phe Phe Thr Glu Glu 
340 



<210> 


41 


<211> 


2559 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2259) 


<223> 


G3 91 


<400> 


41 



atg atg atg gtc cat teg atg age aga gat atg atg aac aga gag teg 
Met Met Met Val His Ser Met Ser Arg Asp Met Met Asn Arg Glu Ser 
15 10 15 

ccg gat aaa ggg tta gat tec ggc aag tat gtg agg tac acg ccg gag 
Pro Asp Lys Gly Leu Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu 
20 25 30 
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caa gtg gaa get etc gag aga gtt tac act gag tgt cct aag cca agt 144 
Gin Val Glu Ala Leu Glu Arg Val Tyr Thr Glu Cys Pro Lys Pro Ser 
35 40 45 

tct eta aga aga caa caa etc ata cgt gaa tgt ccg att etc tct aac 192 
Ser Leu Arg Arg Gin Gin Leu lie Arg Glu Cys Pro He Leu Ser Asn 
50 55 60 

ate gag cct aag cag ate aaa gtt tgg ttt cag aac cgc aga tgt cgt 240 
He Glu Pro Lys Gin He Lys Val Trp Phe Gin Asn Arg Arg Cys Arg 
65 70 75 80 

gag aag cag agg aaa gaa get get cgt ctt caa aca gtg aac aga aaa 288 
Glu Lys Gin Arg Lys Glu Ala Ala Arg Leu Gin Thr Val Aen Arg Lys 
85 90 95 

etc aat gec atg aac aaa etc ttg atg gaa gag aat gat cgt ttg cag 336 
Leu Asn Ala Met Asn Lys Leu* Leu Met Glu Glu Asn Asp Arg Leu Gin 
100 105 110 

aag caa gtt tct aac ttg gtc tat gag aat ggc cac atg aaa cat caa 384 
Lys Gin Val Ser Asn Leu Val Tyr Glu Asn Gly His Met Lys His Gin 
115 • 120 125 

ctt cac act get tct ggg acg acc aca gac aac age tgt gag tct gtg 432 
Leu His Thr Ala Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val 
130 135 140 

gtc gtg agt ggt cag caa cat caa cag caa aac cca aat cct cag cat 480 
Val Val Ser Gly Gin Gin His Gin Gin Gin Asn Pro Asn Pro Gin His 
145 150 155 160 

cag caa cgt gat get aac aac cca gca gga etc ctt tct ata gca gag 528 
Gin Gin Arg Asp Ala Asn Asn Pro Ala Gly Leu Leu Ser lie Ala Glu 
165 170 175 

gag gec eta gca gag ttc ctt tec aag get aca gga act get gtt gac 576 
Glu Ala Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr Ala Val Asp 
180 185 190 

tgg gtt cag atg att ggg atg aag cct ggt ccg gat tct att ggc ata 624 
Trp Val Gin Met He Gly Met Lys Pro Gly Pro Asp Ser He Gly He 
195 200 205 

gtc get att teg cgc aac tgc age gga att gca gca cgt gee tgc ggc 672 
val Ala He Ser Arg Asn Cys Ser Gly He Ala Ala Arg Ala Cys Gly 
210 215 220 

etc gtg agt tta gaa ccc atg aag gtt get gaa att etc aaa gat cgt 720 
Leu Val Ser Leu Glu Pro Met Lys Val Ala Glu He Leu Lys Asp Arg 
225 230 235 240 

cca tct tgg etc cga gat tgt cga agt gtg gat act ctg agt gtg ata 768 
Pro Ser Trp Leu Arg Asp Cys Arg Ser Val Asp Thr Leu Ser Val He 
245 250 255 

cct get gga aac ggt ggg acg ate gag ctt att tac acg cag atg tat 816 
Pro Ala Gly Asn Gly Gly Thr He Glu Leu lie Tyr Thr Gin Met Tyr 
260 265 270 

get cct acg act tta gca gca get cgt gac ttt tgg acg ctg aga tat 864 
Ala Pro Thr Thr Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr 
275 280 285 

age aca tgt ttg gaa gat gga age tat gtg gtt tgt gaa agg teg ctt 912 
Ser Thr Cys Leu Glu Asp Gly Ser Tyr Val val Cys Glu Arg Ser Leu 
290 295 300 

act tct gca act ggt ggc ccc act ggg cca cct tct tea aac ttt gtg 960 
Thr Ser Ala Thr Gly Gly Pro Thr Gly Pro Pro Ser Ser Asn Phe Val 
305 310 315 320 

aga get gaa atg aaa cca age ggg ttt etc ate cgt cct tgc gat ggt 1008 
Arg Ala Glu Met Lys Pro Ser Gly Phe Leu He Arg Pro Cys Asp Gly 
325 330 335 
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ggt ggt tec att etc cac att gtt gat cat gtt gat ctg gat gee tgg 1056 
Gly Gly Ser lie Leu His lie Val Asp His Val Asp Leu Asp Ala Trp 
340 345 350 

agt gtc cct gaa gtc atg agg cct etc tat gaa tea teg aag att ctt 1104 
Ser Val Pro Glu Val Met Arg Pro Leu Tyr Glu Ser Ser Lys lie Leu 
355 360 365 

get cag aaa atg act gtt get get ttg aga cat gta aga caa att gca 1152 
Ala Gin Lys Met Thr Val Ala Ala Leu Arg His Val Arg Gin lie Ala 
370 375 380 

caa gaa aca agt gga gaa gtt cag tat ggt gga ggg cgc caa cct gcg 1200 
Gin Glu Thr Ser Gly Glu Val Gin Tyr Gly Gly Gly Arg Gin Pro Ala 
385 390 395 400 

gtt tta aga ace ttc agt caa- aga etc tgt egg ggt ttc aat gat get 1248 
Val Leu Arg Thr Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala 
405 410 415 

gtt aat ggt ttt gtg gat gat gga tgg tea cca atg ggt age gat ggt 1296 
Val Asn Gly Phe Val Asp Asp Gly Trp Ser Pro Met Gly Ser Asp Gly 
420 425 430 

gca gag gat gtt act gta atg ata aac ttg tec cct ggg aag ttt ggt 1344 
Ala Glu Asp Val Thr Val Met lie Asn Leu Ser Pro Gly Lys Phe Gly 
435 440 445 

ggg tct cag tac ggt aat tea ttc ctt cca age ttt ggt agt ggc gtg 1392 
Gly Ser Gin Tyr Gly Asn Ser Phe Leu Pro Ser Phe Gly Ser Gly Val 
450 455 460 

ctt tgt gec aag gca tct atg ttg ctt cag aac gtt cca ccc get gtg 1440 
Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val 
465 470 475 480 

ctg gtt cga ttc ctt aga gaa cac cga tct gaa tgg get gat tat ggc 1488 
Leu Val Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Tyr Gly 
485 490 495 

gtg gat get tat get get gca teg etc aga gca agt cct ttt get gtt 1536 
Val Asp Ala Tyr Ala Ala Ala Ser Leu Arg Ala Ser Pro Phe Ala Val 
500 505 510 * 

cct tgt get aga get ggg ggg ttc cca agt aac caa gtc att ctt cct 1584 
Pro Cys Ala Arg Ala Gly Gly Phe Pro Ser Asn Gin Val lie Leu Pro 
515 520 525 

ctt gcg cag aca gtt gaa cat gaa gag tea ctt gag gtg gtt aga ctt 1632 
Leu Ala Gin Thr Val Glu His Glu Glu Ser Leu Glu Val Val Arg Leu 
530 535 540 

gaa ggt cac get tac tea ccc gaa gac atg ggt tta get egg gat atg 1680 
Glu Gly His Ala Tyr Ser Pro Glu Asp Met Gly Leu Ala Arg Asp Met 
545 550 555 560 

tat ttg eta cag ctt tgt age ggt gtt gat gaa aat gtg gtt gga ggt 1728 
Tyr Leu Leu Gin Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly 
565 570 575 

tgt gca cag ctt gta ttt gec cct ate gat gaa tea ttt get gat gat 1776 
Cys Ala Gin Leu Val Phe Ala Pro lie Asp Glu Ser Phe Ala Asp Asp 
580 585 590 

gca cct ttg ctt cct tec ggt ttc cgc ate ata cct ctt gaa cag aaa 1824 
Ala Pro Leu Leu Pro Ser Gly Phe Arg lie lie Pro Leu Glu Gin Lys 
595 600 605 

tct act ccg aac ggt gca tct gca aac cgt acc ctg gat tta gec tea 1872 
Ser Thr Pro Asn Gly Ala Ser Ala Asn Arg Thr Leu ABp Leu Ala Ser 
610 615 620 

get tta gaa gga tec aca cgt caa get ggt gaa gee gac cca aat ggc 1920 
Ala Leu Glu Gly Ser Thr Arg Gin Ala Gly Glu Ala Asp Pro Asn Gly 
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625 630 635 640 

tgt aac ttt agg teg gta eta acc ata gca ttc cag ttc aca ttt gat 1968 
Cys Asn Phe Arg Ser Val Leu Thr lie Ala Phe Gin Phe Thr Phe Asp 
645 650 655 

aac cat tea aga gac agt gtt get tea atg gca cgt cag tac gtg cga 2016 
Asn His Ser Arg Asp Ser Val Ala Ser Met Ala Arg Gin Tyr Val Arg 
660 665 670 

age ata gta gga teg att cag agg gtt get eta gee att get cct cgt 2064 
Ser He Val Gly Ser He Gin Arg Val Ala Leu Ala He Ala Pro Arg 
675 680 685 

cct ggc tec aat ate agt cca ata tct gtt ccc act tec cct gaa get 2112 
Pro Gly Ser Asn He Ser Pro He Ser Val Pro Thr Ser Pro Glu Ala 
690 695 700 

etc act ctg gtc cgt-- tgg atc tec egg agt tac age ctt cac act ggt 2160 
Leu Thr Leu Val Arg Trp He Ser Arg Ser Tyr Ser Leu His Thr Gly 
705 710 715 720 

gca gat etc ttt gga tct gat tct caa acc agt ggt gac acg ttg ctg 2208 
Ala Asp Leu Phe Gly Ser Asp Ser Gin Thr Ser Gly Asp Thr Leu Leu 
725 730 735 

cat caa etc tgg aat cac tct gat gca ate ttg tgc tgc tec etc aaa 2256 
His Gin Leu Trp Asn His Ser Asp Ala He Leu Cys Cys Ser Leu Lys 
740 745 750 

aca aacgcttcac eggttttcac attegcaaac caaaceggtt tagacatget 2309 
Thr 



ggaaacgact 


cttgtagccc 


ttcaagacat 


aatgetagae 


aagacccttg 


acgaacctgg 


2369 


tegtaaaget 


etttgetctg 


agttccccaa 


gatcatgeaa 


cagggctatg 


ctcatctgcc 


2429 


ggcaggagta 


tgtgcgtcaa gcatgggaag gatggtatct 


tacgagcagg 


caacggtgtg 


2489 


gaaagttctt 


gaagacgatg 


aatcaaacca 


ctgettaget 


ttcatgttcg 


tgaattggtc 


2549 


gttcgtttga 












2559 



<210> 42 
<211> 753 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 42 

Met Met Met Val His Ser Met Ser Arg Asp Met Met Asn Arg Glu Ser 
15 10 15 

Pro Asp Lys Gly Leu Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu 
20 25 30 



Gin Val Glu Ala Leu Glu Arg Val Tyr Thr Glu Cys Pro Lys Pro Ser 
35 40 45 

Ser Leu Arg Arg Gin Gin Leu He Arg Glu Cys Pro He Leu Ser Asn 
50 55 60 

He Glu Pro Lys Gin He Lys Val Trp Phe Gin Asn Arg Arg Cys Arg 
65 70 75 80 

Glu Lys Gin Arg Lys Glu Ala Ala Arg Leu Gin Thr Val Asn Arg Lys 
85 90 95 
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Leu Asn Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin 
100 105 110 

Lys Gin Val Ser Asn Leu Val Tyr Glu Asn Gly His Met Lys His Gin 

115 120 125 ,-. 

Leu His Thr Ala Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val 
130 135 140 

Val Val Ser Gly Gin Gin His Gin Gin Gin Asn Pro Asn Pro Gin His 
145 150 155 160 

Gin Gin Arg Asp Ala Asn Asn- Pro Ala Gly Leu Leu Ser lie Ala Glu 
. 165' 170 175 

Glu Ala Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr Ala Val Asp 
180 185 190 

Trp Val Gin Met He Gly Met Lys Pro Gly Pro. Asp Ser He Gly He 
195 200 205 

Val Ala He Ser Arg Asn Cys Ser Gly He Ala Ala Arg Ala Cys Gly 
210 215 220 

Leu Val Ser Leu Glu Pro Met Lys Val Ala Glu He Leu Lys Asp Arg 
225 230 235 240 

Pro Ser Trp Leu Arg Asp Cys Arg Ser Val Asp Thr Leu Ser Val He 
245 250 255 

Pro Ala Gly Asn Gly Gly Thr He Glu Leu He Tyr Thr Gin Met Tyr 
260 265 270 

Ala Pro Thr Thr Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr 
275 280 285 

Ser Thr Cys Leu Glu Asp Gly Ser Tyr Val Val Cys Glu Arg Ser Leu 
290 , 295 300 

Thr Ser Ala Thr Gly Gly Pro Thr Gly Pro Pro Ser Ser Asn Phe Val 
305 310 315 320 

Arg Ala Glu Met Lys Pro Ser Gly Phe Leu He Arg Pro Cys Asp Gly 
325 330 335 

Gly Gly Ser He Leu His He Val Asp His Val Asp Leu Asp Ala Trp 
340 345 350 

Ser Val Pro Glu Val Met Arg Pro Leu Tyr Glu Ser Ser Lys He Leu 
355 360 365 

Ala Gin Lys Met Thr Val Ala Ala Leu Arg His Val Arg Gin He Ala 
370 375 380 

Gin Glu Thr Ser Gly Glu Val Gin Tyr Gly Gly Gly Arg Gin Pro Ala 
385 390 395 400 
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Val Leu Arg Thr Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala 
405 410 415 

Val Asn Gly Phe Val Asp Asp Gly Trp Ser Pro Met Gly Ser Asp Gly 
420 425 430 

Ala Glu Asp Val Thr Val Met He Asn Leu Ser Pro Gly Lys Phe Gly 
435 440 445 

Gly Ser Gin Tyr Gly Asn Ser Phe Leu Pro Ser Phe Gly Ser Gly Val 
450 455 460 

Leu Cys Ala Lys Ala- Ser -Met' Leu Leu Gin Asn Val Pro Pro Ala Val 
465 ' 470 475 480 

Leu Val Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Tyr Gly 
485 490 495 

Val Asp Ala Tyr Ala Ala Ala Ser Leu Arg Ala Ser Pro Phe Ala Val 
500 505 510 

Pro Cys Ala Arg Ala Gly Gly Phe Pro Ser Asn Gin Val He Leu Pro 
515 520 525 

Leu Ala Gin Thr Val Glu His Glu Glu Ser Leu Glu Val Val Arg Leu 
530 535 540 

Glu Gly His Ala Tyr Ser Pro Glu Asp Met Gly Leu Ala Arg Asp Met 
545 550 555 560 

Tyr Leu Leu Gin Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly 
565 570 575 

Cys Ala Gin Leu Val Phe Ala Pro He Asp Glu Ser Phe Ala Asp Asp 
580 585 590 

Ala Pro Leu Leu Pro Ser Gly Phe Arg He He Pro Leu Glu Gin Lys 
595 600 605 

Ser Thr Pro Asn Gly Ala Ser Ala Asn Arg Thr Leu Asp Leu Ala Ser 
610 615 620 

Ala Leu Glu Gly Ser Thr Arg Gin Ala Gly Glu Ala Asp Pro Asn Gly 
625 630 635 640 

Cys Asn Phe Arg Ser Val Leu Thr He Ala Phe Gin Phe Thr Phe Asp 
645 650 655 

Asn His Ser Arg Asp Ser Val Ala Ser Met Ala Arg Gin Tyr Val Arg ■ 
660 665 670 

Ser lie Val Gly Ser He Gin Arg Val Ala Leu Ala He Ala Pro Arg 
675 680 685 



Pro Gly Ser Asn He Ser Pro lie Ser Val Pro Thr Ser Pro Glu Ala 
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690 695 700 



Leu Thr Leu Val Arg Trp lie Ser Arg Ser Tyr Ser Leu His Thr Gly 
705 710 715 720 



Ala Asp Leu Phe Gly Ser Asp Ser Gin Thr Ser Gly Asp Thr Leu Leu 
725 730 735 



His Gin Leu Trp Asn His Ser Asp Ala lie Leu Cys Cys Ser Leu Lys 
740 745 750 



Thr 




<210> 


43 


<211> 


2526 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2526) 


<223> 


G390 


<400> 


43 



atg atg get cat cac tec atg gac gat aga gac tct cct gat aaa gga 48 
Met Met Ala His His Ser Met Asp Asp Arg Asp Ser Pro Asp Lys Gly 
15 10 15 

ttt gat tec ggc aag tac gtt aga tac acg ccg gaa caa gtt gaa get 96 
Phe Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala 
20 25 30 

ctt gag aga gtt tat get gag tgt cct aaa cct age tct ctg aga aga 144 
Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys Pro Ser Ser Leu Arg Arg 
35 40 45 

caa cag ctt att cgt gaa tgt ccc att etc tgt aac ate gag cct cga 192 
Gin Gin Leu lie Arg Glu Cys Pro lie Leu Cys Asn lie Glu Pro Arg 
50 55 60 

cag ate aaa gtt tgg ttc cag aat cgc aga tgt cga gag aag cag agg 240 
Gin lie Lys Val Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg 
65 70 75 80 

aaa gag tea get cgt ctt cag aca gtg aac agg aag ctg agt get atg 288 
Lys Glu Ser Ala Arg Leu Gin Thr Val Asn Arg Lys Leu Ser Ala Met 
85 90 95 

aac aag ctt ttg atg gaa gag aat gat cgt ttg cag aag caa gtc tec 336 
Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser 
100 105 110 

aac ttg gtt tat gag aat gga ttc atg aaa cat cga ate cac act get 3 84 

Asn Leu Val Tyr Glu Asn Gly Phe Met Lys His Arg lie His Thr Ala 
115 120 125 

tct ggg acg ace aca gac aac age tgt gag tct gtg gtc gtg agt ggt 432 
Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val Val Val Ser Gly 
130 135 140 

cag caa cgt cag cag caa aac cca aca cat cag cat cct cag cgt gat 4 80 

Gin Gin Arg Gin Gin Gin Asn Pro Thr His Gin His Pro Gin Arg Asp 
145 150 155 160 

gtt aac aac cca get aat ctt etc teg att gcg gag gag acc ttg gcg 528 
Val Asn Asn Pro Ala Asn Leu Leu Ser He Ala Glu Glu Thr Leu Ala 
165 170 175 
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gag etc ctt tgc aag get aca gga act get gtc gac tgg gtc cag atg 
Glu Phe Leu Cys Lys Ala Thr Gly Thr Ala Val Asp Trp Val Gin Met 
180 185 190 



576 



att ggg atg aag cct ggt ccg gat tct att ggt ate gta get gtt tea 
lie Gly Met Lys Pro Gly Pro Asp Ser He Gly He Val Ala Val Ser 
195 200 205 



624 



cgc aac tgc agt gga ata gca gca cgt gee tgt ggc etc gtg agt tta 
Arg Asn Cys Ser Gly He Ala Ala Arg Ala Cys Gly Leu Val Ser Leu 
210 215 220 



672 



gaa ccc atg aag gtc get gaa ate etc aaa gat cgt cca tct tgg ttc 
Glu Pro Met Lys Val Ala Glu He Leu Lys Asp. Arg Pro Ser Trp Phe 
225 230 235 240 



720 



cgt gac tgt cga tgt gtc gag act ctg aat gtt ata ccc act gga aat 
Arg Asp Cys Arg Cys Val Glu - Thr Leu Asn Val He Pro Thr Gly Asn 
245- * • 250 255 



768 



ggt ggt act ate gag ctt gtc aac act cag att tat get cct aca aca 
Gly Gly Thr He Glu Leu Val Asn Thr Gin lie Tyr Ala Pro Thr Thr 
260 265 270 



816 



tta gca gca get cgt gac ttt tgg acg ctg aga tat agt aca agt eta 
Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr Ser Thr Ser Leu 
275 280 285 



864 



gaa gat gga age tat gtg gtc tgt gag aga tea etc act tct gca act 
Glu Asp Gly Ser Tyr Val Val Cys Glu Arg Ser Leu Thr Ser Ala Thr 
290 295 300 



912 



ggt ggc ccc aat ggt cca ctt tct tea age ttc gtg aga gee aaa atg 
Gly Gly Pro Asn Gly Pro Leu Ser Ser Ser Phe Val Arg Ala Lys Met 
305 310 315 320 



960 



ctg tea age ggg ttt ctt ate cgt cct tgt gat ggt ggt ggt tec att 
Leu Ser Ser Gly Phe Leu He Arg Pro Cys Asp Gly Gly Gly Ser He 
325 330 335 



1008 



att cac ate gtt gat cat gtg gac ttg gat gtc tea agt gtt cct gaa 
He His He Val Asp His Val Asp Leu Asp Val Ser Ser Val Pro Glu 
340 345 350 



1056 



gtc etc agg cct ctt tat gag tct tec aaa ate ctt get caa aaa atg 
Val Leu Arg Pro Leu Tyr Glu Ser Ser Lys He Leu Ala Gin Lys Met 
355 360 365 



1104 



act gtc get get ctg aga cat gtg cgc caa att get caa gag act agt 
Thr Val Ala Ala Leu Arg His Val Arg Gin He Ala Gin Glu Thr Ser 
370 375 380 



1152 



gga gaa gtc cag tat agt ggt gga cgc cag cct gca gtt tta agg act 
Gly Glu Val Gin Tyr Ser Gly Gly Arg Gin Pro Ala Val Leu Arg Thr 
385 390 395 400 



1200 



ttc age cag aga etc tgc egg ggt ttc aat gat get gta aat ggt ttt 
Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala Val Asn Gly Phe 
405 410 415 



1248 



gtc gat gat gga tgg tct cca atg agt agt gat gga gga gag gat att 
Val Asp Asp Gly Trp Ser Pro Met Ser Ser Asp Gly Gly Glu Asp He 
420 425 430 



1296 



acg ate atg att aac tct tec tct get aaa ttt get ggc tec caa tac 
Thr He Met He Asn Ser Ser Ser Ala Lys Phe Ala Gly Ser Gin Tyr 
435 440 445 



1344 



ggt age tea ttt ctt cca agt ttt gga agt ggt gtc etc tgt gee aaa 
Gly Ser Ser Phe Leu Pro Ser Phe Gly Ser Gly Val Leu Cys Ala Lys 
450 455 460 



1392 



get tct atg ctg ttg cag aat gtt cca ccc ctt gta ttg att egg ttc 
Ala Ser Met Leu Leu Gin Asn Val Pro Pro Leu Val Leu lie Arg Phe 
465 470 475 4B0 
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ctg aga gaa cac cga get gaa tgg gca gac tat ggt gtc gat gec tat 
Leu Arg Glu His Arg Ala Glu Trp Ala Asp Tyr Gly Val Asp Ala Tyr 
485 490 495 



1488 



tct get gca tct etc aga gca act cca tat get gtt cca tgc gtc aga 
Ser Ala Ala Ser Leu Arg Ala Thr Pro Tyr Ala Val Pro Cys Val Arg 
500 505 510 



1536 



acc ggt ggg ttc ccg agt aac caa gtc att ctt cct etc gca cag aca 
Thr Gly Gly Phe Pro Ser Asn Gin Val lie Leu Pro Leu Ala Gin Thr 
515 520 525 



1584 



etc gaa cat gaa gag ttt etc gaa gtg gtt aga ctt gga ggt cat get 
Leu Glu His Glu Glu Phe Leu Glu Val Val Arg Leu Gly Gly His Ala 
530 535 540 

tac tea cct gaa gac atg ggc tta tec egg gat atg tat tta ctg cag 
Tyr Ser Pro Glu Asp Met Gly Leu Ser Arg Asp Met Tyr Leu Leu Gin 
545 550 555 560 



1632 



1680 



ctt tgt age ggc gtt gat gaa aat gtg gtt gga ggt tgt get cag ctt 
Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly Cys Ala Gin Leu 

565 570 575 



1728 



gtc ttt gec cca ate gat gaa tea ttt get gat gat gca cct ttg ctt 
Val Phe Ala Pro lie Asp Glu Ser Phe Ala Asp Asp Ala Pro Leu Leu 
580 585 590 



1776 



cct tct ggt ttc cgt gtc ata cca etc gac caa aaa aca aat ccg aat 
Pro Ser Gly Phe Arg Val lie Pro Leu Asp Gin Lys Thr Asn Pro Asn 
595 600 605 



1824 



gat cat caa tct gca agt cga aca egg gat eta gca teg tec eta gat 
Asp His Gin Ser Ala Ser Arg Thr Arg Asp Leu Ala Ser Ser Leu Asp 
610 615 620 



1872 



ggt tec acc aaa acc gat teg gaa aca aac tct aga ttg gtc tta aca 
Gly Ser Thr Lys Thr Asp Ser Glu Thr Asn Ser Arg Leu Val Leu Thr 
625 630 635 640 



1920 



ata gec ttc cag ttc acg ttt gat aac cat tec aga gac aat gtt get 
lie Ala Phe Gin Phe Thr Phe Asp Asn His Ser Arg Asp Asn Val Ala 
645 650 655 



1968 



aca atg gcg aga cag tat gtg agg aac gtt gtt ggt teg att cag aga 
Thr Met Ala Arg Gin Tyr Val Arg Asn Val Val Gly Ser lie Gin Arg 
660 665 670 



2016 



gtg get eta gee att acg cct cgt cct ggc tea atg caa ctt ccc act 
Val Ala Leu Ala He Thr Pro Arg Pro Gly Ser Met Gin Leu Pro Thr 
675 . 680 685 



2064 



tec cct gaa get etc act ctt gtc cgt tgg ate acc cgt agt tac agt 
Ser Pro Glu Ala Leu Thr Leu Val Arg Trp He Thr Arg Ser Tyr Ser 
690 695 700 



2112 



att cat aca ggt gca gat ctg ttt gga get gat tct cag tec tgt gga 
He His Thr Gly Ala Asp Leu Phe Gly Ala Asp Ser Gin Ser Cys Gly 
705 710 715 720 



2160 



gga gac aca ttg ctt aag caa etc tgg gac cat agt gat gec ata ttg 
Gly Asp Thr Leu Leu Lys Gin Leu Trp Asp His Ser Asp Ala He Leu 

725 730 735 



2208 



tgc tgc tec ctg aaa act aat gee tea ccg gta ttc aca ttt gca aac 
Cys Cys Ser Leu Lys Thr Asn Ala Ser Pro Val Phe Thr Phe Ala Asn 
740 745 750 



2256 



caa get ggt tta gac atg ctt gaa act aca ctt gtg gca ctt cag gat 
Gin Ala Gly Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp 
755 760 765 



2304 



ata atg etc gac aaa aca ctt gat gac tct ggt cgt aga get ctt tgc 
He Met Leu Asp Lys Thr Leu Asp Asp Ser Gly Arg Arg Ala Leu Cys 
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770 775 780 

tec gag ttc gec aag ate atg cag cag gga tat gcg aat ctt ccg gca 2400 
Ser Glu Phe Ala Lys lie Met Gin Gin Gly Tyr Ala Asn Leu Pro Ala 
785 790 795 800 

gga ata tgt gtg teg age atg ggc aga ccg gtt teg tat gag caa gcg 244 8 
Gly He Cys Val Ser Ser Met Gly Arg Pro Val Ser Tyr Glu Gin Ala 
805 810 815 

a cg gtg tgg aaa gtt gtt gat gac aac gaa tea aac cac tgc ttg get 24 96 
Thr Val Trp Lys Val Val Asp Asp Asn Glu Ser Asn His Cys Leu Ala 
820 825 830 

ttt ace etc gtt agt tgg teg ttt gtt tga 2526 
Phe Thr Leu Val Ser Trp Ser Phe Val 
835 840 

<210> 44 

<211> 841 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 44 

Met Met Ala His His Ser Met Asp Asp Arg Asp Ser Pro Asp Lys Gly 
1 5 10 15 

Phe Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala 
20 25 30 

Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys Pro Ser Ser Leu Arg Arg 
35 40 45 

Gin Gin Leu He Arg Glu Cys Pro He Leu CyB Asn He Glu Pro Arg 
50 55 60 

Gin He Lys Val Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg 
65 .70 75 80 

Lys Glu Ser Ala Arg Leu Gin Thr Val Asn Arg Lys Leu Ser Ala Met 
85 90 95 

Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser 
100 105 110 

Asn Leu Val Tyr Glu Asn Gly Phe Met Lys His Arg He His Thr Ala 
115 120 125 

Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val Val Val Ser Gly 
130 135 140 

Gin Gin Arg Gin Gin Gin Asn Pro Thr His Gin His Pro Gin Arg Asp 
145 150 155 160 

Val Asn Asn Pro Ala Asn Leu Leu Ser He Ala Glu Glu Thr Leu Ala 
165 170 175 

Glu Phe Leu Cys Lys Ala Thr Gly Thr Ala Val Asp Trp Val Gin Met 
180 185 190 



He Gly Met Lys Pro Gly Pro Asp Ser 



He Gly He Val Ala Val Ser 
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195 200 205 

Arg Asn Cys Ser Gly lie Ala Ala Arg Ala Cys Gly Leu Val Ser Leu 
210 215 220 

Glu Pro Met Lys Val Ala Glu lie Leu Lys Asp Arg Pro Ser Trp Phe 
225 230 235 240 

Arg Asp Cys Arg Cys Val Glu Thr Leu Asn Val lie Pro Thr Gly Asn 
245 250 255 

Gly Gly Thr lie Glu Leu Val Asn Thr Gin lie Tyr Ala Pro Thr Thr 
260 265 270 

Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr Ser Thr Ser Leu 
275 280 285 

Glu Asp Gly Ser Tyr Val Val Cys Glu Arg Ser Leu Thr Ser Ala Thr 
290 295 300 

Gly Gly Pro Asn Gly Pro Leu Ser Ser Ser Phe Val Arg Ala Lys Met 
305 310 315 320 

Leu Ser Ser Gly Phe Leu lie Arg Pro Cys Asp Gly Gly Gly Ser lie 
325 330 335 

lie His lie Val Asp His Val Asp Leu Asp Val Ser Ser Val Pro Glu 
340 345 350 

Val Leu Arg Pro Leu Tyr Glu Ser Ser Lys He Leu Ala Gin Lys Met 
355 360 365 

Thr Val Ala Ala Leu Arg His Val Arg Gin He Ala Gin Glu Thr Ser 
370 375 380 

Gly Glu Val Gin Tyr Ser Gly Gly Arg Gin Pro Ala Val Leu Arg Thr 
385 390 395 400 

Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala Val Asn Gly Phe 
405 410 415 

Val Asp Asp Gly Trp Ser Pro Met Ser Ser Asp Gly Gly Glu Asp He 
420 425 430 

Thr He Met He Asn Ser Ser Ser Ala Lys Phe Ala Gly Ser Gin Tyr 
435 440 445 

Gly Ser Ser Phe Leu Pro Ser Phe Gly Ser Gly Val Leu Cys Ala Lys 
450 455 460 

Ala Ser Met Leu Leu Gin Asn Val Pro Pro Leu Val Leu He Arg Phe 
465 470 475 480 

Leu Arg Glu His Arg Ala Glu Trp Ala Asp Tyr Gly Val Asp Ala Tyr 
485 490 495 
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Ser Ala Ala Ser Leu Arg Ala Thr Pro Tyr Ala Val Pro Cys Val Arg 
500 505 510 

Thr Gly Gly Phe Pro Ser Asn Gin Val lie Leu Pro Leu Ala Gin Thr 
515 520 525 

Leu Glu His Glu Glu Phe Leu Glu Val Val Arg Leu Gly Gly His Ala 
530 535 540 

Tyr Ser Pro Glu Asp Met Gly Leu Ser Arg Asp Met Tyr Leu Leu Gin 
545 550 555 560 

Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly Cys Ala Gin Leu 
565 - 570 575 

Val Phe "Ala Pro He Asp Glu Ser Phe Ala Asp Asp Ala Pro Leu Leu 
580 585 590 

Pro Ser Gly Phe Arg Val He Pro Leu Asp Gin Lys Thr Asn Pro Asn 
595 600 605 

Asp His Gin Ser Ala Ser Arg Thr Arg Asp Leu Ala Ser Ser Leu Asp 
610 615 620 

Gly Ser Thr Lys Thr Asp Ser Glu Thr Asn Ser Arg Leu Val Leu Thr 
625 630 635 640 

He Ala Phe Gin Phe Thr Phe Asp Asn His Ser Arg Aap Asn Val Ala 
645 650 655 

Thr Met Ala Arg Gin Tyr Val Arg Asn Val Val Gly Ser He Gin Arg 
660 665 670 

Val Ala Leu Ala He Thr Pro Arg Pro Gly Ser Met Gin Leu Pro Thr 
675 680 685 

Ser Pro Glu Ala Leu Thr Leu Val Arg Trp He Thr Arg Ser Tyr Ser 
690 695 700 

He His Thr Gly Ala Asp Leu Phe Gly Ala Asp Ser Gin Ser Cys Gly 
705 710 715 720 

Gly Asp Thr Leu Leu Lys Gin Leu Trp Asp His. Ser Asp Ala He Leu 
725 730 735 

Cys Cys Ser Leu Lys Thr Asn Ala Ser Pro Val Phe Thr Phe Ala Asn 
740 745 750 

Gin Ala Gly Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp 
755 760 765 

He Met Leu Asp Lys Thr Leu Asp Asp Ser Gly Arg Arg Ala Leu Cys 
770 775 780 

Ser Glu Phe Ala Lys He Met Gin Gin Gly Tyr Ala Asn Leu Pro Ala 
785 790 795 800 
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Gly lie Cys Val Ser Ser Met Gly Arg Pro Val Ser Tyr Glu Gin Ala 
805 810 815 

Thr Val Trp Lys Val Val Asp Asp Asn Glu Ser Asn His Cys Leu Ala 
R?a 825 fnn 



Phe Thr Leu Val Ser Trp Ser Phe Val 

840 





835 


<210> 


45 


<211> 


2511 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2511) 


<223> 


G1548 


<400> 


45 



atg gca atg tct tgc aag gat ggt aag ttg gga tgt ttg gat aat ggg 

Met Ala Met Ser Cys Lys Asp Gly Lys Leu Gly Cys Leu Asp Asn Gly 

1 5 10 15 

aag tat gtg agg tat aca cct gaa caa gtt gaa gca ctt gag agg ctt 

Lya Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala Leu Glu Arg Leu 
20 25 30 

tat cat gac tgt cct aaa ccg agt tct att cgc cgt cag cag ttg ate 

Tyr His Asp Cys Pro Lys Pro Ser Ser lie Arg Arg Gin Gin Leu lie 
35 40 45 

aga gag tgt cct att etc tct aac att gag cct aaa cag ate aaa gtg 

Arg Glu Cys Pro lie Leu Ser Asn lie Glu Pro Lys Gin lie Lys Val 
50 55 60 

tgg ttt cag aac cga aga tgt aga gag aaa caa agg aaa gag get tea 

Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg Lys Glu Ala Ser 

65 70 75 80 

egg ctt caa get gtg aat egg aag ttg acg gca atg aac aag etc ttg 

Arg Leu Gin Ala Val Asn Arg Lys Leu Thr Ala Met Asn Lys Leu Leu 

85 90 95 

atg gag gag aat gac agg ttg cag aag caa gtg tea cag ctg gtc cat 

Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser Gin Leu Val His 
100 105 110 

gaa aac age tac ttc cgt caa cat act cca aat cct tea etc cca get 

Glu Asn Ser Tyr Phe Arg Gin His Thr Pro Asn Pro Ser Leu Pro Ala 
115 120 125 

aaa gac aca age tgt gaa teg gtg gtg acg agt ggt cag cac caa ttg 

Lys Asp Thr Ser Cys Glu Ser Val Val Thr Ser Gly Gin His Gin Leu 
130 135 140 

gca tct caa aat cct cag aga gat get agt cct gca gga ctt ttg tec 

Ala Ser Gin Asn Pro Gin Arg Asp Ala Ser Pro Ala Gly Leu Leu Ser 

145 150 155 160 

att gca gaa gaa act tta gca gag ttt ctt tea aag gca act gga ace 

lie Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr 

165 170 175 

get gtt gag tgg gtt cag atg cct gga atg aag cct ggt ccg gat tec 

Ala Val Glu Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro Asp Ser 
180 185 190 

att gga ate ate get att tct cat ggt tgc act ggt gtg gca gca cgc 

He Gly He He Ala He Ser His Gly Cys Thr Gly Val Ala Ala Arg 
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gcc tgt ggc eta gtg ggt ctt gag cct aca agg gtt gca gag att gtc 672 
Ala Cys Gly Leu Val Gly Leu Glu Pro Thr Arg Val Ala Glu lie Val 
210 215 220 

aag gat cgt cct teg tgg ttc cgc gaa tgt cga get gtt gaa gtt atg _ 720 
Lys Asp Arg Pro Ser Trp Phe Arg Glu Cys Arg Ala Val Glu Val Met 
225 230 235 240 

aac gtg ttg cca act gcc aat ggt gga acc gtt gag ctg ctt tat atg 768 
Asn Val Leu Pro Thr Ala Asn Gly Gly Thr Val Glu Leu Leu Tyr Met 
245 250 255 

cag etc tat gca cca act aca ttg gcc cca cca cgc gat ttc tgg ctg 816 
Gin Leu Tyr Ala Pro Thr Thr Leu Ala Pro Pro Arg Asp Phe Trp Leu 
260 265 270 

tta cgt tac acc tct, gtt tta gaa gat ggc age ctt gtg gtg tgc gag 864 
Leu Arg Tyr Thr Ser Val Leu Glu Asp Gly Ser Leu Val Val Cys Glu 
275 280 285 

aga tct ctt aag age act caa aat ggt cct agt atg cca ctg gtt cag 912 
Arg Ser Leu Lys Ser Thr Gin Asn Gly Pro Ser Met Pro Leu Val Gin 
290 295 300 

aat ttt gtg aga gca gag atg ctt tec agt ggg tac ttg ata egg cct 960 
Asn Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu He Arg Pro 
305 310 315 320 

tgt gat ggt ggt ggc tea ate ata cac ata gtg gat cat atg gat ttg 1008 
Cys Asp Gly Gly Gly Ser He He His He Val Asp His Met Asp Leu 
325 330 335 

gag get tgt age gtg cct gag gtc ttg cgc ccg etc tat gag tea ccc 1056 
Glu Ala Cys Ser Val Pro Glu Val Leu Arg Pro Leu Tyr Glu Ser Pro 
340 345 350 

aaa gta ctt gca cag aag aca aca atg gcg gca ctg cgt cag etc aag 1104 
Lys Val Leu Ala Gin Lys Thr Thr Met Ala Ala Leu Arg Gin Leu Lys 
355 360 365 

caa ata get cag gag gtt act cag act aat agt agt gtt aat ggg tgg 1152 
Gin He Ala Gin Glu Val Thr Gin Thr Asn Ser Ser Val Asn Gly Trp 
370 375 380 

gga egg cgt cct get gcc tta aga get etc age cag agg eta age aga 1200 
Gly Arg Arg Pro Ala Ala Leu Arg Ala Leu Ser Gin Arg Leu Ser Arg 
385 390 395 400 

ggc ttc aat gaa get gta aat ggt ttc act gat gaa gga tgg tea gtg 1248 
Gly Phe Asn Glu Ala Val Asn Gly Phe Thr Asp Glu Gly Trp Ser Val 
405 410 415 

ata gga gat age atg gat gat gtc aca ate act gta aac tct tct cca 1296 
lie Gly Asp Ser Met Asp Asp val Thr lie Thr Val Asn Ser Ser Pro 
420 425 430 

gac aag eta atg ggt eta aat ctt aca ttt gcc aat ggc ttt get cct 1344 
Asp Lys Leu Met Gly Leu Asn Leu Thr Phe Ala Asn Gly Phe Ala Pro 
435 440 445 

gta age aat gtt gtt tta tgc gca aaa gca tea atg ctt tta cag aat 1392 
Val Ser Asn Val Val Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn 
450 . 455 460 

gtt cct ccg gcg ate ctg ctt egg ttt ctg agg gag cat agg tea gaa 1440 
Val Pro Pro Ala He Leu Leu Arg Phe Leu Arg Glu His Arg Ser Glu 
465 470 475 480 

tgg get gac aac aac att gat gcg tat eta gca gca gca gtt aaa gta 1488 
Trp Ala Asp Asn Asn lie Asp Ala Tyr Leu Ala Ala Ala Val Lys Val 
485 490 495 

ggg cct tgt agt gcc cga gtt gga gga ttt gga ggg cag gtt ata ctt 1536 
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Gly Pro Cys Ser Ala Arg Val Gly Gly Phe Gly Gly Gin Val He Leu 
500 505 510 

cca ctt get cat act att gag cat gaa gag ttt atg gaa gtc ate aaa 1584 
Pro Leu Ala His Thr He Glu His Glu Glu Phe Met Glu Val He Lys 
515 520 525 

ttg gaa ggt ctt ggt cat tec cct gaa gat gca ate gtt cca aga gat 1632 
Leu Glu Gly Leu Gly His Ser Pro Glu Asp Ala He Val Pro Arg Asp 
530 535 540 

ate ttc ctt ctt caa ctt tgt age gga atg gat gaa aat get gta gga 1680 
He Phe Leu Leu Gin Leu Cys Ser Gly Met Asp Glu Asn Ala Val Gly 
545 550 555 560 

ace tgt gcg gaa ctt ata ttt get cca ate gat get teg ttt gcg gat 1728 
Thr Cys Ala Glu Leu He Phe Ala Pro He Asp Ala Ser Phe Ala Asp 
565 570 575 

gat gca cct' ctg ctt cct tct ggt ttt cgt att ate cct ctt gat tec 1776 
Asp Ala Pro Leu Leu Pro Ser Gly Phe Arg He He Pro Leu Asp Ser 
580 585 590 

gca aag gaa gta tct age cca aac cga acc ttg gat ctt get teg gca 1824 
Ala Lys Glu Val Ser Ser Pro Asn Arg Thr Leu Asp Leu Ala Ser Ala 
595 600- 605 

ctg gaa att ggt tea get gga aca aaa gee tea act gat caa tea gga 1872 
Leu Glu lie Gly Ser Ala Gly Thr Lys Ala Ser Thr Asp Gin Ser Gly 
610 615 620 

aac tec aca tgt gca aga tct gtg atg aca ata gca ttt gag ttt ggt 1920 
Asn Ser Thr Cys Ala Arg Ser Val Met Thr He Ala Phe Glu Phe Gly 
625 630 635 640 

ate gag age cat atg caa gaa cat gta gca tec atg get agg cag tat 1968 
He Glu Ser His Met Gin Glu His Val Ala Ser Met Ala Arg Gin Tyr 
645 650 655 

gtt cga ggt ate ata tea teg gtg cag aga gta gca ttg get ctt tct 2016 
Val Arg Gly He He Ser Ser Val Gin Arg Val Ala Leu Ala Leu Ser 
660 665 670 

cct tct cat ate age tea caa gtt ggt eta cgc act cct ttg ggt act 2064 
Pro Ser His He Ser Ser Gin Val Gly Leu Arg Thr Pro Leu Gly Thr 
675 680 685 

cct gaa gec caa aca ctt get cgt tgg att tgc cag agt tac agg ggc 2112 
Pro Glu Ala Gin Thr Leu Ala Arg Trp lie Cys Gin Ser Tyr Arg Gly 
690 695 700 

tac atg ggt gtt gag eta ctt aaa tea aac agt gac ggc aat gaa tct 2160 
Tyr Met Gly Val Glu Leu Leu Lys Ser Asn Ser Asp Gly Asn Glu Ser 
705 710 715 720 

att ctt aag aat ctt tgg cat cac act gat get ata ate tgc tgc tea 2208 
He Leu Lys Asn Leu Trp His His Thr Asp Ala He He Cys Cys Ser 
725 730 735 

atg aag gee ttg ccc gtc ttc aca ttt gca aac cag gcg gga ctt gac 2256 
Met Lys Ala Leu Pro Val Phe Thr Phe Ala Asn Gin Ala Gly Leu Asp 
740 745 750 

atg ctg gag act aca tta gtt get ctt caa gac ate tct tta gag aag 2304 
Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp He Ser Leu Glu Lys 
755 760 765 

ata ttt gat gac aat gga aga aag act ctt tgc tct gag ttc cca cag 2352 
He Phe Asp Asp Asn Gly Arg Lys Thr Leu Cys Ser Glu Phe Pro Gin 
770 775 780 - 

ate atg caa cag ggc ttc gcg tgc ctt caa ggc ggg ata tgt etc tea 2400 
lie Met Gin Gin Gly Phe Ala Cys Leu Gin Gly Gly He Cys Leu Ser 
785 790 795 800 
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age atg ggg aga cca gtt teg tat gag aga gca gtt get tgg aaa gta 2448 
Ser Met Gly Arg Pro Val Ser Tyr Glu Arg Ala Val Ala Trp Lys Val 
805 810 815 

etc aat gaa gaa gaa aat get cat tgc ate tgc ttt gtg ttc ate aat 2496 
Leu Asn Glu Glu Glu Asn Ala His Cys lie Cys Phe Val Phe lie Asn 
820 825 830 

2511 



tgg tec ttt gtg tga 


Trp Ser Phe val 




835 


<210> 


46 


<211> 


836 


<212> 


PRT 


<213> 


Arabidopsis thaliana 


<400> 


46 -. ■ ■ * - 



Met Ala Met Ser Cys Lys Asp Gly Lys Leu Gly Cys Leu Asp Asn Gly 
15 10 15 



Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala Leu Glu Arg Leu 
20 25 30 



Tyr His Asp Cys Pro Lys Pro Ser Ser lie Arg Arg Gin Gin Leu lie 
35 40 45 



Arg Glu Cys Pro lie Leu Ser Asn lie Glu Pro Lys Gin lie Lys Val 
50 55 60 



Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg Lys Glu Ala Ser 
65 70 75 80 



Arg Leu Gin Ala Val Asn Arg Lys Leu Thr Ala Met Asn Lys Leu Leu 
85 90 95 



Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser Gin Leu Val His 
100 105 110 



Glu Asn Ser Tyr Phe Arg Gin His Thr Pro Asn Pro Ser Leu Pro Ala 
115 120 125 



Lys Asp Thr Ser Cys Glu Ser Val Val Thr Ser Gly Gin His Gin Leu 
130 135 140 



Ala Ser Gin Asn Pro Gin Arg Asp Ala Ser Pro Ala Gly Leu Leu Ser 
145 150 155 160 



lie Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr 
165 170 175 



Ala Val Glu Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro Asp Ser 
180 185 190 



lie Gly He He Ala He Ser His Gly Cys Thr Gly Val Ala Ala Arg 
195 200 205 



Ala Cys Gly Leu Val Gly Leu Glu Pro Thr Arg Val Ala Glu He Val 
210 215 220 
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Lys Asp Arg Pro Ser Trp Phe Arg Glu Cys Arg Ala Val Glu Val Met 
225 230 235 240 

Asn Val Leu Pro Thr Ala Asn Gly Gly Thr Val Glu Leu Leu Tyr Met 

245 250 255 

Gin Leu Tyr Ala Pro Thr Thr Leu Ala Pro Pro Arg Asp Phe Trp Leu 
260 265 270 

Leu Arg Tyr Thr Ser Val Leu Glu Asp Gly Ser Leu Val Val Cys Glu 
275 280 285 

Arg Ser Leu Lys Ser Thr Gin Asn Gly Pro Ser Met Pro Leu Val Gin 
290 295 300 

Asn Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu lie Arg Pro 
305 310 315 320 

Cys Asp Gly Gly Gly Ser lie He His He Val Asp His Met Asp Leu 

325 330 335 

Glu Ala Cys Ser Val Pro Glu Val Leu Arg Pro Leu Tyr Glu Ser Pro 
340 345 350 

Lys Val Leu Ala Gin Lys Thr Thr Met Ala Ala Leu Arg Gin Leu Lys 
355 360 365 

Gin He Ala Gin Glu Val Thr Gin Thr Asn Ser Ser Val Asn Gly Trp 
370 .375 380 

Gly Arg Arg Pro Ala Ala Leu Arg Ala Leu Ser Gin Arg Leu Ser Arg 
385 390 395 400 

Gly Phe Asn Glu Ala Val Asn Gly Phe Thr Asp Glu Gly Trp Ser Val 
405 410 415 

He Gly Asp Ser Met Asp Asp Val Thr He Thr Val Asn Ser Ser Pro 
420 425 ' 430 

Asp Lys Leu Met Gly Leu Asn Leu Thr Phe Ala Asn Gly Phe Ala Pro 
435 440 445 

Val Ser Asn Val Val Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn 
450 455 460 

Val Pro Pro Ala He Leu Leu Arg Phe Leu Arg Glu His Arg Ser Glu 
465 470 475 480 

Trp Ala Asp Asn Asn He Asp Ala Tyr Leu Ala Ala Ala Val Lys Val 
485 490 495 

Gly Pro Cys Ser Ala Arg Val Gly Gly Phe Gly Gly Gin Val He Leu 
500 505 510 

Pro Leu Ala His Thr He Glu His Glu Glu Phe Met Glu Val He Lys 
515 520 525 
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Leu Glu Gly Leu Gly His Ser Pro Glu Asp Ala lie val Pro Arg Asp 
530 535 540 



lie Phe Leu Leu Gin Leu Cys Ser Gly Met Asp Glu Asn Ala Val Gly 
545 550 555 560 



Thr Cys Ala Glu Leu lie Phe Ala Pro lie Asp Ala Ser Phe Ala Asp 
565 570 S75 



Asp Ala Pro Leu Leu Pro Ser Gly Phe Arg lie lie Pro Leu Asp Ser 
580 585 590 



Ala Lys Glu Val Ser Ser. Pro- Asn Arg Thr Leu Asp Leu Ala Ser Ala 
595" 600 605 

Leu Glu He Gly Ser Ala Gly Thr Lys Ala Ser Thr Asp Gin Ser Gly 
610 615 620 

Asn Ser Thr Cys Ala Arg Ser Val Met Thr lie Ala Phe Glu Phe Gly 
625 630 635 640 

He Glu Ser His Met Gin Glu His Val Ala Ser Met Ala Arg Gin Tyr 
645 650 655 

Val Arg Gly lie He Ser Ser Val Gin Arg Val Ala Leu Ala Leu Ser 
660 665 670 

Pro Ser His He Ser Ser Gin Val Gly Leu Arg Thr Pro Leu Gly Thr 
675 680 685 

Pro Glu Ala Gin Thr Leu Ala Arg Trp He Cys Gin Ser Tyr Arg Gly 
690 695 700 

Tyr Met Gly Val Glu Leu Leu Lys Ser Asn Ser Asp Gly Asn Glu Ser 
705 710 715 720 

He Leu Lys Asn Leu Trp His His ' Thr Asp Ala He He Cys Cys Ser 
725 730 735 

Met Lys Ala Leu Pro Val Phe Thr Phe Ala Asn Gin Ala Gly Leu Asp 
740 745 750 



Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp He Ser Leu Glu Lys 
755 760 765 

He Phe Asp Asp Asn Gly Arg Lys Thr Leu Cys Ser Glu Phe Pro Gin 
770 775 780 

He Met Gin Gin Gly Phe Ala Cys Leu Gin Gly Gly He Cys Leu Ser 
785 790 795 800 

Ser Met Gly Arg Pro Val Ser Tyr Glu Arg Ala Val Ala Trp Lys Val 
805 810 815 



Leu Asn Glu Glu Glu Asn Ala His Cys He Cys Phe Val Phe He Asn 
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820 825 830 



Trp Ser Phe Val 
835 
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